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TRANSGENIC ANIIWALS PRODUCED BY 
HOMOLOGOUS SEQUENCE TARGETING 



FPt nOFTHF INVENTION 

The inventton relates to methods for targeting an exogenous polynudeotide or exogenous 
5 complementary polynucleotide pair to a predetennined endogenous DNA target sequence in a target 
cell by homologous pairing, particularly for altering an endogenous DNA sequence, such as a 
chromosomal DNA sequence, typically by targeted homologous recombination. In certain 
embodiments, the invention relates to methods for targeting an exogenous polynucleotide having a 
linked chemical substituent to a predetemiined endogenous DNA sequence in a nretabolically active 
1 0 target cell, generating a DNA sequence-specific targeting of one or more chemical substituents in an 
intact nucleus of a metabolically active living target cell, generally for purposes of altering a 
predetermined endogenous DNA sequence in the cell. The invention also relates to compositions and 
formulations that contain exogenous targeting polynucleotides, complementa^ pairs of exogenous 
targeting polynucleotides, chemical substituents of such polynucteotides. and recombinase proteins, 
15 including recombinosome proteins and other targeting proteins, used In the methods of the invention. 

BACKGROUND 

Homologous recombination (or general recombination) is defined as the exchange of homologous 
segments anywhere along a length of two DNA molecules. An essential feature of general 
recombination is that the enzymes responsible for the recombination event can presumably use any 
20 pair of homologous sequences as substrates, although some types of sequence may be favored over 
others. Both genetic and cytological studies have indicated that such a crossing-over process occurs 
between pairs of homologous chromosomes during meiosis in higher organisms. 

Alternatively, in site-specific recombination, exchange occurs at a specific site, as In the Integration of 
phage 8 into the £. coff chromosome and the excision of 8 DNA from it. Site-specific recombination 
25 involves specific sequences of the phage DNA and bacterial DNA. Wrthin these sequences there is 
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25 



30 



only a short stretch of homology necessary for the recombination event but not sufficient for it The 
enzymes involved in this event generally cannot recombine other pairs of homologous (or 
nonhomologous) sequences, but act specifically on the particular phage and bacterial sequences. 

Although both site-spedfic recombination and homologous recombination are useful mechanisms for 
genetic engineering of DNA sequences, targeted homologous recombination provides a basis for 
targeting and altering essentially any desired sequence in a duplex DNA molecule, such as targeting a 
DNA sequence in a chromosome for replacement by another sequence. Site-spedfic recombination 
hag been proposed as one method to integrate transfected DNA at chromosomal locations having 
specific recognition sites (O'Gomian et al. (1991) S£isaS2S251: 1351; OnouchI et al. (1991) tia^ 
Asid^Ess. m. 6373). Unfortunately, since this approach requires the presence of specific target 
sequences and recombinases. its utility for targeting recombination evente at any particular 
chromosomal location is severely limited in comparison to targeted general recombination. 

Forthese reasons and others, targeted homologous recombination has been proposed for treating 
human genetic diseases. Human genetic diseases include (1) classical human genetic diseases 
wherein a disease allele having a mutant genetic lesion Is inherited from a parent (e.g.. adenosine 
deaminase defidency. sidde cell anemia, thalassemias). (2) complex genetic diseases like cancer 
where the pathological state generally results from one or more spedfic inherited or acquired 
mutations, and (3) acquired genetic disease, sudi as an integrated provirus (e.g.. hepatftis B virus) 
.However, current methods of targeted homologous recombination are ineffident and produce desired 
homologous recombinants only rarely, necessitating complex cell selection schemes to identily and 
isolate con'ectly targeted recombinants. 

A primary step in homologous recombination is DNA strand exchange, which involves a pairing of a 
DNA duplex with at least one DNA strand containing a complementaor sequence to fomi an 
intermediate recombination structure containing heteroduplex DNA (sfis. Radding. CM. (1982) Ann 
BgSLiSetieL 16: 405: U.S. Patent 4.888.274). The heteroduplex DNA may take several forms, 
induding a three DNA strand containing triplex fbm, wherein a single complementary strand ir^vades 
the DNA duplex (Hsieh et al. (1990) Cengs and Developmfnt 4: 1951; Rao et al.. (1991) PNAS 
88:2984)) and, when two complementary DNA strands pair with a DNA duplex, a classical Holliday 
recombination joint or chi strudure (Holliday, R. (1 964) SeneLEss. §: 282) may form, or a double-D 
toop ("Diagnostic Applications of Double-D Loop Formation" U.S.S.N. 07/755,462, filed 4 September 
1991. whid, is incorporated herein by reference). Once fomied. a heteroduplex structure may be 
resolved by strend breakage and exchange, so that all or a portion of an invading DNA strand is 
spliced into a recipient DNA duplex, adding or repladng a segment of the recipient DNA duplex. 
Alternatively, a heteroduplex stmcture may result in gene conversion, wherein a sequence of an 
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invading strand is transfen-ed to a recipient DNA duplex by repair of mismatched bases using the 
invading strand as a template (Genes . 3rd Ed. (1987) Lewin. B.. John Wiley. New York, NY; Lopez et 
ai. (iQft7) fOimieic Acids Res. 15: 5643). Whether by the mechanism of breakage and rejoining or by 
the mechanism(s) of gene conversion, formation of heteroduplex DNA at homologously paired joints 
5 can serve to transfer genetic sequence information from one DNA molecule to another. 

The ability of homologous recombination (gene conversion and classical strand breakage/rejoining) to 
fransfer genetic sequence infomnation between DNA molecules makes targeted honrwiogous 
recombination a powerful method in genetic engineering and gene manipulation. 

The ability of mammalian and human cells to incorporate exogenous genetic material into genes 

10 residing on chromosomes has demonstrated that these cells have the general enzymatic machinery 
for carrying out homologous recombination required between resident and introduced sequences. 
These targeted recombination events can be used to correct mutations at known sites, replace genes 
or gene segments with defective ones, or introduce foreign genes into cells. The efficiency of such 
gene targeting techniques is related to several parameters: the efficiency of DNA delivery into cells. 

15 the type of DNA packaging (if any) and the size and confomiation of the incoming DNA, the length and 
position of regions homologous to the target sHe (all these parameters also likely affect the ability of 
the incoming homologous DNA sequences to survive intraceilular nuclease attack), the efficiency of 
recombination at particular chromosomal sites and whether recombinant events are homotogous or 
nonhomologous. Over the past 10 years or so, several methods have been devetoped to Introduce 

20 DNA into mammalian cells: direct needle microinjection, transfection. electroporation, 

electroihcorporation, retroviruses, adenovimses. adeno-associated viruses; Herpes viruses, and other 
viral packaging and delivery systems, polyamidoamine dendimers, liposomes, and most recently 
techniques using DNA-coated microprojectiles delivered with a gene gun (called a biolistics device), or 
narrow-beam lasers (laser-poration). The processes associated with some types of gene transfer 

25 have been shown to be both mutagenic and carcinogenic (Bardwell, (1 989) MMtaqgn??is 4: 245), and 
these possibilities must be considered in choosing a transfection approach. 

The choice of a particular DNA transfection procedure depends upon its availability to the researcher, 
the technique's efficiency with the particular chosen target cell type, and the resear^ers concems 
about the potential for generating unwanted genome mutations. For example, retroviral integration 
30 requires dividing cells, always results in nonhomologous recombination events, and retroviral insertion 
within a coding sequence of nonhomologous (i.e.. non-targeted) gene could cause cell mutation, by 
inactivating the gene's coding sequence (Friedmann. (1989) ggigripe 244.1275). Newer retroviral- 
based DNA delivery systems are being developed using defective retroviruses. However, these 
disabled vimses must be packaged using helper systems, are often obtained at low titer, and 
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recombination is still not site-specific, thus recombination between endogenous cellular retrovirus 
secjuences and disabled vims sequences could sUII produce wild-type retrovinis capable of causing 
gene mutation. Adeno- or polyoma virus based delivery systems appear very promising (Samulski et 
al.. (1991)IMBai Jfl: 2941; Gareis et al.. (1991) Cell. Molec. Biol ^j- 191; RosenfeW etal. (1992) 
5 eeU fifi: 143) although they still require specific cell membrane recognition and binding characteristics 
for target cell entry. Liposomes often show a narrow spedwm of cell specificities, and when DNA is 
coated externally on to them, the DNA is often sensitive to cellular nucleases. Newer polycationic 
lipospermines compounds exhibit broad cell ranges (Behr et al.. (19891 Proc. Natl. Arari R^j f ig^ac- 
6982) and DNA is coated by these compounds. In addition, a combination of neutral and cationic lipid 

10 has been shown to be highly efficient at transfection of animal cells and showed a broad spectrum of 
effectiveness in a variety of cell lines (Rose etal.. (1991) fiiQlgdmiflUM 10:520). Galactosylated bis- 
acridine has also been described as a carrier for delivery of polynucleotides to liver cells (Haensler JL 
and SzokaFC (1992). Abstract V211 in J^^slLBiosbgoL Supplement 16F, April 3-16. 1992. 
Incorporated herein by reference). Electnoporation also appears to be applicable to most cell types. 

15 The efficiency of this procedure for a specific gene is variable and can range from about one event per 

3 X 10- transfected cells (Thomas and Capecchi. (1987) CfiD 51: 503) to between one In 10^ and 10» 
cells receiving the exogenous DNA (Koller and Smithies, (1989) Proc. NaM. Acad. Sei niRA^} gg; 
8932). Microinjection of exogenous DNA into the nucleus has been reported to result in a high 
frequency of stable transfected cells. Zimmer and Gruss (Zimmer and Gmss (1989) Nature 33g: 150) 
20 have reported that for the mouse bssOA gene. 1 per 150 microinjected cells showed a stable 
homologous site specific alteration. 

Several methods have been developed to detect and/or select for targeted site-specific recombinants 
between vector DNA and the target homologous chromosomal sequence (stfi. Capeechi. (1989) 
Scisn^ 244: 1288 for review). Cells which exhibit a specific phenotype after site-specific 

25 recombination, such as occurs with alteration of the bm gene, can be obtained by direct selection on 
the appropriate growth medium. Alternatively, a selective mariner sequence such as dss can be 
incorporated into a vector under promoter control, and successful transfection can be scored by 
selecting G418' cells followed by PCR to determine whettierneo is at the targeted site (Joyner et al.. 
(1989) HalUie 33fi: 153). A positive-negative selection (PNS) procedure using both flge and HSV-fls 

30 genes allows selection for transfectants and against nonhomologous recombination events, and 
significantiy enriched for desired disruption events at several different mouse genes (Mansour et al.. 
(1988) liitijriaaS: 348). This procedure has ttie advantage that the method does not require Uiat the 
targeted gene be transcribed. If the targeted gene is transcribed, a promoter-less mari<er gene can be 

incorporated into tiie targeting construct so that the gene becomes activated after homologous 
35 recombination With the target site (Jasin and Berg. (1988) Genes and n^v»i»nmoht g: 1353; 

Doetschman et al. (1988) Pfoc. Natl. Apart Sri (USA) fig: 8583; Dorini et al.. (1989) SsiSD^ 242: 
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1357; Itzhaki and Porter! (iQQi) N..ri Acids Res. 19: 3835). Recombinant products produced using 
vectors with selectable markers often continue to retain these markers as foreign genetic material at 
the site of transfection. although loss does occur. Valancius and Smithies (Valancius and Smithies. 
(1991) M"ipr. Cellular Biol. H: 1402) have described an "in-out" targeting procedure that allowed a 

5 subtle 4-bp insertion modification of a mouse m target gene. The resulting transfectant contained 
only the desired modified gene sequence and no selectable marker remained after the "ouT 
recombination step. Cotransfbrmationof cells virilh two different vectors, one vector contained a 
selectable gene and the other used for gene disruption, Increases the efficiency of isolating a specific 
targeting reaction (Reid et al.. (1991) MolSQ, CelMgrPlPLll: 2769) among selected cells that are 

1 0 subsequently scored for stable recombinants. 

Unfortunately, exogenous sequences transferred into eukaryotic cells undergo homologous 
recombination with homologous endogenous sequences only at very low frequencies, and are so 
inefficiently recombined that large numbers of cells must be transfected. selected, and screened in 
order to generate a desired correctly targeted homologous recombinant (Kucheriapati et al. (1984) 
1 5 Dro>- Mot. Ar.ari Sci fU.S.A.) fil: 3153; Smithies. 0. (1985) mm 212: 230; Song et al. (1987) ECSSL 
Ma ^i frrari sci. ru.S.A.) 84: 6820; Doetschman etal. (1987)!!lsJii£Bm 576; Kim and Smithies 
(1988) hi..riPir Acids Res. 16: 8887; Doetschman etal. (1988) oariL: Koller and Smithies (1989) 
oasaL; Shesely et al. (^°°-)Pr^ fu^" A«.ri Sri m S A.^ fifi: 4294; Kim et al. (1991) SSfiDS m 227. 
which are incorporated herein by reference). 

20 Koller et al. (1991) op^; m^i Sd m.S.A.V fifi: 10730 and Snouwaert et al. (1992) SfflfiDSg 252: 
1083. have described targetihg of the mouse cystic fibrosis transmembrane regulator (Cm^) gene for 
the purpose of Inactivating, rather than correcting, a murine CFTR allele. Koller et al. employed a 
large (7.8kb) homology region in the doubie-stranded DNA targeting construct, but nonetheless 
reported a low frequency for correct targeting (only 1 of 2500 G418-resistant cells were correctly 

25 targeted). Thus, even targeting constructs having lone homology regions are inefficiently targeted. 

Several proteins or purified extracts having the property of promoting homologous recombination (i.e.. 
recombinase activity) have been identified in prokaryotes and eukaryotes (Cox and Lehman (1987) 
Ann r^^v Biochem. 56: 229; Redding. CM. (1982) QBS^. Madiraju et al. (1988) PpTff, Natl.AWtf.Scl 
m s.A.) 25: 6592; McCarthy et al. (1988) pr>v> A«,rt Sd. m.S.A.) fifi: 5854; Lopez et al. (1987) 
30 QB^,which are incorporated herein by reference). These general recombinases presumably 

promote one or more steps In the formation of honrwiogously-paired Intermediates, strand-exchange, 
gerie conversion, and/or other steps in the process of homologous recombination. 
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The frequency of homologous recombination In prokaryotes is significantly enhanced by the presence 
of recombinase activities. Several purified proteins catalyze homologous pairing and/or strand 
exchange in Jdlifi. including: E. coli recA protein, the T4 uvsX protein, and the red protein from 
Ustilago maydis. Recbmbinases. like the recA protein of £. coli are proteins which promote strand 
pairing and exchange. The most studied recombinase to date has been the recA recombinase of £ 
coli. which IS involved In homology search and strand exchange reactions (see. Cox and Lehman 
(1987) maU. RecA is required fbr lnduction of the SOS repair response. DNA repair, and efficient 
genetic recombination in £ coli. RecA can catalyze homologous pairing of a linear duplex DNA and a 
homologous single strand DNA iajdtts. In contrast to site-specific recomblnases. proteins like 
which are involved in general recombination recognize and promote pairing of DNA structures on the 
basis of shared homology, as has been shown by several injdtia experiments (Hsieh and Camerini- 
Otero (1989) XJioLChm 264: 5089; Howard-Flanders et al. (1984) ito 222: 215; Stasiak et al. 
(1984) Cold Spring HartxirSymp Qiisnt Riol 49: 561; Register et al. (1987) J. Biol. Cham 9fi9- 
12812). Several im/estigators have used recA protein in vjire to promote homologously paired tnplex 
DNA (Cheng et al. (1988) J.BioLChem.m 15110; Ferrin and Carney 

1494: Ramdasetal. (m9),Lm£bm.m 11395; Strobeletal. (1991)§signse254: 1639- Hsieh 
et al. (1990) siLfiiL: Rigas et al. (1986) Pres. Natl Aoari Sg f 1 1 M ) M: 9591; and Camerini-Otero et 
al. U.S. 7.61 1.268 (available from Derwent). whk:h are incorporated herein by reference). 
Unfortunately many important genetto engineering manipulations involving homotogous recombination 
such as using homologous recombination to alter endogenous DNA sequences in a Mng cell, cannot " 
be done in vitro. Further, gene therapy requires highly efficient homologous recombination of targeting 
vectors with predetermined endogenous target sequences, since selectable mariner selection 
schemes, such as those currently available in the art are not usually practicable in human beings. 

Thus, there exists a need in the art for methods of efficiently altering predetermined endogenous 
genetfc sequences by homotogous pairing and homologous recombination in ybffi by introducing one 
or more exogenous targeting polynucleotlde(s) that efficiently and specifically homologously pair with a 
predetemiined endogenous DNA sequence. There exists a need In the art for high-effidency gene 
targeting, so as to avoid complex in yitre selection protocols (e.g.. neq gene selection with G418). 
which are of limited utility for in jdjaj gene therapy on affected indivkiuals 

SUMMARY OF THE INVENTIOM 

It is an object of the present Invention to provide methods for targeting an exogenous polynucleotide to 
a predetemiined endogenous DNA target sequence In a target cell with high efficiency and with 
sequence specificity. Exogenous polynucleotides, are localized (or targeted) to one or more 
predetemiined DNA target sequence(s) by homologous pairing iQyjjffi. Such targeted homotogous 
35 pairing of exogenous polynucleotides to endogenous DNA sequences io yjvo may be used: (1) to 
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target chemical substituents in a sequence-specific manner in yjyfi. (2) to correct or to generate 
genetic mutations in endogenous DNA sequences by homologous recombination and/or gene 
conversion. (3) to produce homologously targeted transgenic animals and plants at high efficiency, 
and (4) in other applications (e.g., targeted drug delivery) based on in yjyo homologous pairing. Some 
5 embodiments of the invention employ targeted exogenous polynucleotides to correct endogenous 
mutant gene alleles in human cells; the invention provides mettiods and compositions for correcting 
disease alleles involved in producing human genetic diseases, such as inherited genetic diseases 
(e.g.. cystic fibrosis) and neoplasia (e.g., neoplasms induced by somatic mutation of an oncogene or 
tumor suppressor gene, such as p53, or viral genes associated Wrth neoplasia, such as HBV genes). 

10 In one embodiment at least one exogenous polynucleotide is targeted to a predetemnined 

endogenous DNA sequence and alters the endogenous DNA sequence, such as a chromosomal DNA 
sequence, typically by targeted homologous recombination within and/or flanking the predetermined . 
endogenous DNA sequence. Generally, two complementary exogenous polynucleotides are used for 
targeting an endogenous DNA sequence. Typically, the targeting polynucleotide(s) are introduced 

1 5 simultaneously or contemporaneously with one or more recombinase species. Alternatively, one or 
more recombinase species may be induced or produced in Jdyfl. for example by expression of a 
heterologous expression cassette in a cell containing the preselected target DNA sequence. 

It is another object of the invention to provide methods whereby at least one exogenous polynucleotide 
containing a chemical substituent can be targeted to a predetermined endogenous DNA sequence in a 

20 metabolically-active or intact living target cell, permitting sequence-specific targeting of chemical 

substituents such as. for example cross-linking agents, metal chelates (e.g., iron/EDTA chelate for iron 
catalyzed cleavage), topoisomerases, endonucleases, exonucleases. ligases, phosphodiesterases, 
photodynamic porphyrins, free-radical generating drugs, chemptherapeutic drugs (e.g.. adriamycin, 
doxirubicin). intercalating agents, base-modification agents, immunoglobulin chains, oligonucleotides. 

25 and otiier substituents. The methods of the invention can be used to target such a chemical 

substituent to a predetermined DNA sequence by homologous pairing for various applications, for 
example: producing sequence-specific sti^nd scission(s), producing sequence-specific chemical 
modifications (e.g.. base methylation. strand cross-linking), producing sequence-spedfic localization of 
polypeptides (e.g.. topoisomerases. helicases. proteases), producing sequence-specific localization of 

30 polynucleotides (e.g.. loading sites for transcription factors and/or RNA polymerase), and other 
applications. 

It is another object of the present invention to provide methods for correcting a genetic mutation in an 
endogenous DNA target sequence, such as a sequence encoding an RNA or a protein. For example, 
the invention can be used to correct genetic mutations, such as base substitutions, additions, and/or 
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deletions, by converting a mutant DNA sequence that encodes a non-functional. dysfunctional, and/or 
truncated polypeptide into a corrected DNA sequence that encodes a functional polypeptide (e g has 
a biological activity such as an enzymatic activity, hormone function, or other biological property) ' The 
methods and compositions of the invention may also be used to correct genetic mutations or 
dysfunctional alleles with genetic lesions in nonK»ding sequences (e.g.. promoters, enhancers 
silencers, origins of replication, splicing signals). In contradistinction, the invention also can be used to 
target DNA sequences for Inactivating gene expression; a targeting polynucleotide can be employed to 
make a targeted base substitution, addition, and/or deletion in a structural or regulatory endogenous 
DNA sequence to alter expression of one or more genes, typically by knocking out at least one allele 
of a gene (i.e.. making a mutant, nonfunctional allele). The invention can also be used to correct 
disease alleles, such as a human or non-human animal CFTR gene allele associated with cystic 
fibrosis, by producing a targeted alteratton in the disease allele to correct a diseaseK:ausing lesion 
(e.g.. a deletion). 



»'^^'"*^^°''i«'=t°^theinventiontop,ovklemethodsandcompositionsfb^ 
1 5 targeting of human genetic disease alteles. such as a CFTR allele associated with cystic fibrosis or an 
LDL receptor allele associated with familial hypercholesterolemia. In one aspect of the invention 
targeting polynucleotides having at least one associated recombinase are targeted to cells in jdj^a (I e 
in an intact animal) by exploiting the advantages of a receptor-mediated uptake mechanism such as " 
an asyoglycoprotein receptor-mediated uptake process. In this variation, a targeting polynucleotide is 
20 associated with a recombinase and a cell-uptake component which enhances the uptake of the 
targeting polynucleotide- recombinase into cells of at least one cell type in an intact individual For 
example, but not limitation, a cell-uptake component typically consists of: (I) a galactose-terminal 
(asialo-) glycoprotein (e.g.. aslaloorosomucoW) capable of being recognized and internalized by 
specialized receptors (asialoglycoprotein receptor) on hepatocytes in m. and (2) a polycation. such 

25 as poly-L-lysine. which binds to the targeting polynucleotide, usually by electrostatic interaction 
Typically, the targeting polynucleotide is coated with recombinase and cell^jptake component 
simultaneously so that both recombinase and cell-uptake component bind to the targeting 
polynucleotide: alternatively, a targeting polynucleotide can be coated with recombinase prior to 

^ ;"'^battonwithacell-uptakecomponent:alternatively thete^^^^^^ 

30 *«^''-"Pta'<ecomponentandintnx.ucedintocellscontempo,aneousbr^ 

recombinase {e.g.. by targeted liposomes containing one or more recombinase). * 

The invention also provides methods and composittons for diagnosis, treatment and pr<,phyla)ds of 
genetic diseases of animals, particulariy mammals, wherein a recombinase and a targeting 
polynucleotide are used to produce a targeted sequence modification in a disease allele of an 
35 endogenous gene. The invention may also be used to produce targeted sequence mod^K:ation(s) in a 
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non-human animal, particulariy a non-human mammal such as a mouse, which create(s) a disease 
allele In a non-human animal. Sequence-modified non-human animals harboring such a disease allele 
may provide useful models of human and veterinary disease(s). Altematively, the methods and 
compositions of the invention can t>e used to provide nonhunfwn animals having homologously- 
5 targeted human disease alleles integrated into a non-human genome; such non-human animals may 
provide useful experimental models of human or other animal genetic disease, Including neoplastic 
and other pathogenic diseases. 

It is also an object of the invention to provide methods and compositions for recombinase- enhanced 
positioning of a targeting polynucleotide to a homologous sequence in an endogenous chromosome to 

1 0 form a stable multistrand complex, and thereby alter expression of a predetermined gene sequence by 
interfering with transcription of sequence(s) adjacent to the multistrand complex. Recombinase(s) are 
used to ensure con*ect homologous pairing and formation of a stable multistrand complex, which may 
include a double-D loop structure. For example, a targeting polynucleotide coated with a recombinase 
may homologously pair with an endogenous chromosomal sequence in a structural or regulatory 

1 5 sequence of a gene and form a stable multistrand complex which may: (1 ) constitute a significant 
physical or chemical obstacle to fonmation of or procession of an active transcriptional complex 
comprising at least an RNA polymerase, or (2) alter the local chromatin structure so as to alter the 
transcription rate of gene sequences within about 1 to 500 kilobases of the multistrand complex. 

It is another object of the invention to provide methods and compositions for treating or preventing 
20 acquired human and animal diseases, particularly parasitic or viral diseases, such as human hepatitis 
B virus (HBV) hepatitis, by targeting viral gene sequences with a recombinase-associated targeting 
polynucleotide and thereby inactivating said viral gene sequences and inhibiting viral-induced 
pathology. 

It is a further object of the invention to provide compositions that contain exogenous targeting 
25 polynucleotides, complementary pairs of targeting polynucleotides, chemical substltuents of such 
polynucleotides, and recombinase proteins used in the methods of the invention. Such compositions 
may include a targeting or cell-uptake components to facilitate intracellular uptake of a targeting 
polynucleotide, especially for M ms, gene therapy and gene modification. 

In accordance with the above objects, the present invention provides methods for targeting and 
30 altering, by homologous recombination, a pre-selected target nucleic acid sequence in a cell to make a 
targeted sequence modification. The methods comprise introducing into at least one cell at least one 
recombinase and at least two single-stranded targeting polynucleotides which are substantially 
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complementary to each other and comprise a homology clamp that substantially corresponds to or is 
substantially complementary to a preselected target nucleic acid sequence. 

In an additional aspect, the invention provides compositions for producing targeted modifications of 
target sequences, including disease alleles, comprising two substantially complementary 
single-stranded targeting polynucleotides and at least one recombinase. 

PRIgF PESCRIPTIOW OF THP npi^yi|^ft^ 
Fig. 1. Homologous targeting of licA-coated chromosome 1 alpha-satellite polynucleotides in 
metabolically active cell nuclei. The homologously targeted blotinylated polynucleotides were 
visualized by addition of FITC-avidin followed by washing to remove unbound FITC. Signals were 
visualized using a Zeiss Confocal Laser Scanning Microscope (CLSM-IO) with 488 nm argon laser 
beam illumination for FITC-DNA detection. iQaJefl -focalized FITC-DNA signals in the cell nucleus. 
LSMSdfifi -enhanced image of FITC-DNA signals in the cell nucleus. Uooerrioht - image of FITC- 
DNA signals overlaid on the phase image ofnucleus. Umum - phase image of the center of the 
cell nucleus showing nucleoli. Note: all images except lower right were photographed at the same 
15 focus level (focus unchanged between these photos). 

Figs. 2A. 2B. 2C. 2D. 2E. 2F, 2G. 2H. 21, 2J. 2K. and 2L. RecA protein-mediated native FISH in 
metabolically acth/e cell nuclei. Hep-2 cell nuclei from cells encapsulated in agarose were incubated 
with RecA-coated biotinylated p53 DNA (A-l) or RecA-coated blotinylated chromosome 1 satellite III 
DNA probes (K-L). Panels B-l show FISH signals in digital images from serial CLSM optical sections 
of PrrC^abeled p53 probe DNA incubated in metabolically active Hep-2 nuclei. The phase image of a 
representative nucleous in shown in Panel A and was sectfoned by CLSM. Digital images in Panels 
were serially overiaid upon one another to produce the composite digital image shown in Panel I 
containing alt three FITC labeled p53 FISH signals. The effect of cssDNA probe concentration and 
RecA protein on efficiency of native dsDNA hybridization in metabolically active nuclei Is shown in 
25 Panel J. The percentage of labeled RecA coated or uncoated p53 cssDNA is shown as a function of 
the amount of p53 DNA probe per hybridization reaction. Closed circles show hybridization leactions 
with RecA-coated p53 cssDNA probe, open triangles show control reactions wittiout RecA protein 
coating of p53 cssDNA probe. Panel K shows the FISH digital image in Panel L overiaid onto the 
phase image. 

30 Fig.3.GeneticmapofmammalianexpressionlacZplasmidpMC1lacXpAwiti,an1l base insertion in 
the Xk^ linker site. 
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Fig. 4. Genetic map of the mammalian expression lacZ plasmid pMCIIacpA. with an insertion 
mutation. 

Fig. 5. PGR products and primers from the lacZ {(i-galactosidase) gene sequence. The location of the 
1 1 bp Xba linker is shown. 

5 Fig. 6. Tests for alteration of an insertion mutation in the lacZ gene of a eukaryotic expression vector. 
NIH 3T3 cells were needle microinjected with five types of plasmids: Two plasmids contained a 
wild-type B-galactosidase gene (pMCIIacpa or pSV-B-gal [Promega]); a plasmid with a mutant B-gal 
gene (pMCIIacXpa); pMCIIacXpa plasmid incubated with a protein uncoated wild-type 276-mer DNA; 
or piyiCI lacXpa plasmid reacted and D-looped with RecA-coated wild-type 276-mer DNA. The 

10 wild-type 276-mer DNA was heat denatured and either coated or not coated with RecA protein in a 
standard RecA protein coating reaction protocol (Sena and Zarling. supra). Following a 10-min RecA 
coating reaction, the RecA-coated complementary single-stranded 276-mers were incubated at 37X 
for 60 min. with the mutant target plasmid to allow hybrid formation. A 60 min incubation of the mutant 
target plasmid DNA with uncoated complementary single-stranded nonnal wild-type 276-mers was 

1 5 earned out as a control and hybrids were not formed. The B-galactosidase activity in needle 

microinjected cells using the wild-type plasmids is shown for comparison. On average, about 50% of 
the total microinjected cells survived. The numbers of surviving cells scoring blue with the mutant 
plasmid hybridized with RecA-coated CSS DNA and reacted with non-RecA-coated CSS DNA 
samples (3, 4 and 5) were compared with fourfold P^ tests. The frequency of corrected blue cells In 

20 the RecA-coated CSS DNA samples (Sample 5; 6 out of 1 68) is significantly higher than that of either 
Sample 3 or Sample 4. The frequency of con-ected RecA-coated CSS DNA probeitarget hybrids blue 
cells in Sample 5 is significantly higher than that of Sample 4 at the 5% significance level (P^ = 3.76 > 
P^aos)- Th® frequency of connected blue cells in Sample 5 containing RecA-coated CSS DNA 
probe:target hybrids is significantly higher than that of Sample 3 at the 1% significance level (P^ = 6.28 

25 > P^ooi)- When Samples 3 and 4 are combined and compared with Sample 5, the frequency of 
corrected blue cells in Sample 5 is significantly higher thari that of the combined sample at the 0.1% 
signficance level (P2 = 9.99 > P^aooi)- 

Fig. 7A. Southem hybridization analysis of the 687-bp fragment amplified from genomic DNA. 
Electrophoretic migration of a 687-bp DNA fragment generated with primers CF1 and CF6 from 
30 genomic DNA of 3CFTE29o-cells which were capillary needle-microinjected with the 491 -nucleotide 
DNA fragment in the presence of recA protein (lane 2) or transfected as a protein-DNA-lipid complex 
where the 491 -nucleotide fragments were coated with recA protein (+; lane 3). The control DNA was 
amplified from nontransfected 3CFTE29o-cultures (lane 1). 
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Fig. 7B. Autoradiographic analysis of DNA transferred to Gene Screen Plus filters and hybridized with 
a ''P-iabeled oligonucleotide specific for normal exon 10 sequences in the region of the AF508 
mutation. Cells transfected by micro-injection or protein-lipid-DNA complexes both were positive for 
homologous targeting, whereas control cells were not. 

Fig. 8A. Analysis of DNA from cells eleclroporated or transfected with CSS DNA encapsulated in a 
protein-lipid complex. Allele-specific PCR ampBfication of the 687/684 bp DNA fragment amplified in 
the first round with primers CF1 and oligo N (N) or oligo AF (AF). Ethldium bromide-stained 300 bp 
DNA fragment separated by electrophoresis in a 1% agarose gel. The DNA in each lane is as follows: 
lane 1. 100-bp marker DNA; lane 2. control 16HBE14o-cell DNA amplified with the CF1/N primer pair, 
lane 3, nontransfected ECFTE29o-cel! DNA amplified with CF1/N primers; lane 4. nbntransfected 
ECFTE29o-cell DNA amplified with CF1/AF primers; lane 5. DNA from ECFTE29o-cells electroporated 
with recA-coated 491-nucleolide fragments and amplified with CF1/N primers; lane 6, DNA from 
ECFTE29o-cells transfected with recA-coated 491.nucleotide fragment encapsulated in a protein-lipid 
complex and amplified with CF1/N primers. 

Fig. SB. Autoradiographic analysis of the DNA in Fig. IIA transferred to Gene Screen Plus filters and 
hybridized with «P-labeled oligo N probe. Samples in lanes 1-5 for the autoradiographic analysis are 
equivalent to samples in lanes 2-6 In Fig. IIA. 

Fig. 9. PCR analysis of 3CFTE29o-genomic DNA reconstructed with the addition of 2 x 10* copies of 
recA-coated 491-nucleotide CSS DNA fragmente per microgram of genomic DNA. This number of 
CSS DNA fragments represents the total number of DNA copies microinjected into cells and tests 
whether the 491-nucleotide fragment can act as a primer for the 687/684-bp fragment amplification. 
DNA was amplified as described in Fig. 8A. When the second round of amplification was conducted 
with CF1 and oligo N primers (lane 2), the 300-bp DNA band was not detected when allquots of the 
amplification reaction were separated electrophoretically. Amplification of the ECFTE29o/491 bp DNA . 
fragment with the CF1/oligo AF primer pair produced a 299-bp DNA product (lane I). Mariter DNA is in 
lane 3. 

Figure 1 0 depicts the scheme for the recombination assay used in Example 4. 

Fig. 1 1 shows RecA mediated cssDNA targeting to dsDNA with deletions produces a mixed population 
of probertarget hybrids. The biotinylated cssDNA probes were denatured and coated with RecA at 
37'C as described in Material . The reaction mixture was incubated for 60 minutes at 37''C. All 
reactions were stopped by deproteinization with 1.2% SDS and separated by electrophoresis on a 20 
cm X 25 cm 1% agarose gel. The gel was run ovemight at 30V then blotted onto a positively charged 
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Tropilon Plus (TROPIX) membrane. The DNAwas monitored for the presence of unhybridized probe 
or probe:target hybrids using an alkaline phosphatase based chemiluminescent detection of biotin. 
When the membranes were exposed to X-ray film and developed, it is evident that cssDNA probes will 
hybridize to dsDNA targets which are completely homologous, as well as dsDNA targets which contain 

5 a deletion (lanes 3 and 6. respectively). RecA mediated cssDNA targeting to completely honrwiogous 
dsDNA (pRD.O) forms a probe:target hybrid whose electrophoretic mobility is comparable to the 
electrophoretic mobility of completely relaxed Form I DNA. which is similar to the nwbility of Form II 
DNA (lanes 3, 8. and 10), referred to as the rl* hybrid. RecA mediated hybridization of cssDNA to 
dsDNA containing a 59 base pair deletion (pRD.59). a probertarget hybrid that migrates to a position 

10 similar to Form I DNA (lane 6). is referred to as the 1* hybrid. 

Fig. 12 shows data for the enhanced homologous recombination (EHR) of cssDNA probertarget 
hybrids in E. coli. as per Example 4. The homologously targeted probe:target hybrids have enhanced 
homologous recombination frequencies in recombination proficient ceils. cssDNA probeitarget hybrids 
were formed as described in the legend of Figure 11 and were introduced Into RecA+ and RecA-E. coli 

15 as in described Figure 12. The molar ratio of cssDNA probeitarget in the in vitro targeting reaction 
varied from 1:1 to 1:5.6. The % recpmbinant/total colonies is the percentage of blue colonies in the 
total population of ampicillin-resistant colonies. Groups with 0% recombinants did not produce any 
blue colonies in at least 10* plated colonies. Plasmid DNA was isolated from blue colonies that were 
serially propagated for three generations to determine if homologous recombination stably occurred in 

20 the lacZ gene. 

Fig.. 13 shows double D-loop hybrids with internal homology clamps. A) Duplex target DNA (thin line) 
is completely homologous to the cssDNA probe (thick) and each probe strand can pair with its 
complementary strand in the target B) Duplex target has a deletion with respect to the cssDNA probe. 
The deleted region is indicated with a dashed line. The region of the cssDNA probes homologous to 
25 the deleted region In the target can re-pair with each other forming a stable hybrid complex. C) 

Duplex target has an insertion (dashed line) with respect to the cssDNA probe. Structures on the left 
show the re-annealing of cssDNA probe or target strands to form intemal homology clamps. 
Structures on the right show the presence of unpaired regions in comparable single D-loop hybrids. 

Figs. 14A and 14B. Figure 14A depicts the Maps of Plasmids pRD.O and pRD.59. Relative positions 
30 of cssDNA probes IP290 and CP443, PGR primers 1A and 48, restriction endonuclease sites EcoRI, 
Seal, and Dral are indicated. The alpha peptide sequence of the LacZ gene is indicated. Note the 
deletion ()) in pRD.59 is approximately equidistant from the ends of primers 1 A and 48. Figure 148). 
Time course for cssDNA probe:target hybrid formation with linear dsDNA targets. Biotinylated. RecA 
coated cssDNA probe IP290 was hybridized as described to Seal -digested plasmids pRD.O and 
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pRD.59 carrying 0 or 59 bp deletibn. respectively at the EcoR1 site in pRD.O. Probe IP290 is 
completely homologous to pRD.O. but has a 59 bp Insertion with respect to pRD.59. 

Fig. 1 5 depicts the formation of cssDNA probe:target hybrids formed with linear dsDNA targets 
containing snriall deletions. A) Plasmid constructs and probes used in this study. A series of plasmids 
5 with defined deletions were constructed from the EcoRI site of pRD.O (pbluescriptllSK+ (Stratagene) 
as described in Example 5. Each plasmid is named for the size of the deletion, as indicated on the 
left. A series of cssDNA probes were labelled and constructed by PGR from various primers which 
flank the deleted region. Probes were made from either pRD.O or the deleted plasmids and named for 
the size of tlie probe when made from pRD.O (2960 bp). For example. p527 is 527 bp long. When the 

10 cssDNA probes are produced from pRD.O and targeted to plasmids containing deletions, the probe is 
called IP527 to indicate that the insertion probe (IP) has an insertion with respect to the target When 
the probe is made from one of the targets with a deletion and then, targeted to pRD.O, the probe is 
caHed DP527 to indicate that the deletion probe (DP) has a deletion with respect to pRD.O. Control 
probe CP443 is made from a region of pRD.O that does not contain any insertions or deletions. The 

15 limits of the deleted regions in the plasmid DNA target are indicated by dashed lines and the size ymits 
ofcssDNA probes are indicated by sond lines. B) BioHnylated cssDNA probes IP527. IP407. and 
CP443 were coated with RecA protein and hybridized at 37'C to a series of linear duplex DNA 
targets containing deletions ranging in size from 0 to 447 bp. The products of the targeting reaction 
were deproteinized and separated on a 1 % TAE-agarose gel and then transferred to nylon 
20 membranes as described in Example 5. Biotinylated DNA was detected with a chemiluminescent 
substrate as described. The extent of hybrid product formation of Form III DNA targets was 
determined by densitometry of the autoradiographs. The relative amount of hybrid fomied between 
RecA coated cssDNA probes IP527 and IP407 is shown in (B). Error bars are indicated. The amount 
of probe:target hybrids formed with each target DNA was nomialized by the amount of probertarget 
25 hybrids fanned with control probe CP443 which hybridizes to the target located in a region which is a 
significant distance away from the deletion site. Examples of the cssDNA probe:taiget hybrid formed 
with linear targets are shown in the autoradiogram (C). In Fig. 16(D) the difference in the percent 
hybrid formation between cssDNA probes IP527 and IP407 are plotted from the data shown in (B). 

Fig. 16 depicts that insertions and deletions have the same effect on the relative efficiency of 
30 probertarget hybrid fonnation. RecA-coated cssDNA probes IP215 made from pRD^O was targeted to 
Seal-digests of plasmids pRD.O, pRD.8. pRD.25, and pRD.59 and compared to similar reactions of 
DP215 cssDNA probes made from pRD.O. pRD.8. pRD.25. and pRD.59 and targeted to pRD.O. The 
effect of insertions in the cssDNA probe (dartc line) is compared with deletions in the cssDNA probe 
(shaded line) of the same size. The relative level of hybrid fomiation for each cssDNA probe with a 
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heterologous target is normalized by the level of hybridization with the homologous target, 
respectively. The data represents an average of three experiments. Error bars are indicated. 

Figs 1 7A, 1 7B and 17C. Figure 17A depict the fomnation of stable double-D-Loop hybrids in linear 
dsDNA targets containing large deletions. Biotinylated cssDNA probe IP1246 was coated with RecA 

5 protein and targeted to Seal digests of the indicated plasmids as described herein. The relative 

amount of hybrid formation formed between RecA-coated cssDNA probes and plasmids with deletions 
ranging from 0-967 bp was normalized to the amount of probe:target hybrids fomrted with control probe 
CP443. Autoradiograph (17A) shows the bioBnylated cssDNA probes or probe:target hybrids. The 
position of the untargeted Seal-digested (Formlll) marker for each of the plasmids are indicated on the 

1 0 right. The relative level of hybrid formation (B) of each of the bands in (A) was normalized to the level 
of hybrid formation with control cssDNA probe CP443. as described herein. The relative position of the 
cssDNA probes with respect to the position of the deletion in the target DNA is shown in (C), 

Figs. 18A, 18B. 18C and 18D depict the formation of restriction endonuclease sites in probeitarget 
hybrids. The probeitarget hybrids formed between probe IP290 and pRD.O and pRD.59 targets were 

1 5 deproteinized by extraction with chlorofbrm:phenol:isoamyl alcohol and chlorofonn. Restriction 

enzyme treated DNA samples were incubated with EcoRI for three hours before separation on a 1% 
agarose gel and transferred onto a nylon membrane. The ethidium bromide stained DNA of the 
products of the targeting reactions formed between cssDNA probe IP290 and circular plasmid targets 
pRD.O or pRD:69 (A and B) and autoradiographs showing the positions of biotinylated cssDNA 

20 probe:target hybrids (C and D) are shown. The positions of form I and form III mari<ers of pRD.O are 
shown on the right The positions of the pRD59 hybrids 1* (form I) and rl* (relaxed) are shown on the 
left 

Fig. 19 depicts the thermal stability of relaxed and non-relaxed probeitarget hybrids. The RecA 
mediated cssDNA targeting reaction was performed with the cssDNA probe IP290 and the dsDNA 
25 target pRD.59, as described herein. The probeitarget hybrids were deproteinized with 1 .2% SDS and 
then incubated for 5 minutes at the indicated temperatures. The themially melted products were then 
separated on a 1% agarose gel and blotted onto a positively charged Tropilon membrane. 
Autoradiograph shows the position of biotinylated cssDNA probeitarget hybrids i* (fbrmi) and rl* 
(relaxed) as shown on the left. 

30 Figs. 20A and 20B. The organization of the mouse OTC gene. Sequence of cssDNA probes and 
PGR primers used in this study are indicated. Sizes of the exons in base pairs are indicated. The 
relative position of PGR primers M9. iyi8 and M1 1 are shown. B) Map of plasmid pTAOTGI . A 250 bp 
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fragment containing the normal OTC exon 4 sequence and sun-ounding introns were cloned into the 
EcoRV site of pbluescrlpt SK (+) (Stratagene). 

Fig. 21 . Sequence analysis of exon 4 of the mouse OTC gene In founder mice. PGR amplification of 
genomic DNA from tail biopsies of a pool of all of the homozygous (spf-ash/spf-ash) females used as 
egg donors and each indicated individual founder mice were sequenced using cycle sequencing with 
the 1^1 1 primer (Cyclist kit. Stratagene). The DNA sequence sunounding the spf-ash locus (anrow) in 
the OTC gene is shown. 

Fig. 22. Germline transmission of OTC+ aliele corrected by EHR. The inheritance patterns of the 
OTC alleles are depicted. Legend indicates the genotype and/or phenotype of the FO, F1, and F2 
mice produced from microinjected zygotes obtained from the cross of homozygous (spf-ash/spf-ash) 
mutant females and normal males (top). The genotype of FO and F1 animals were determined by 
DNA sequencing and the typing of F2 animals as deduced by phenotype. Control cross A of 
(hemizygous spf-ash/Y) mutant FO male with normal (+/+) females and control cross B of 
heterozygous (spf-ash/+) F1 females with a normal male are indicated. The number below the boxes 
or circles indicate the total number of mice of each type produced from each cross. Total numbers of 
mice counted are representative of 2-4 litters. Mouse #213 and #1014 (noted by arrow) are F1 
animals that carry a germiine transmitted gene corrected allele from mosaic HR gene corrected male 
mouse #16. 

Fig. 23. Germline transmission of corrected allele of FO male #16. Pictures of F1 progeny from the 
cross of mouse #16 with homozygous (spf-ash/spf-ash) females (top). This cross produced several 
pups with spf-ash mutant phenotypes (middle) and one F1 pup (#1014) with a normal phenotype. 
Three views of mouse #1014 are shown (bottom). All of the F1 animals were two weeks old at the 
time of photography. 

DEFINITIONS 

Unless defined othenvise, all technical and scientific terms used herein have the same meaning as 
commonly understood by one of ordinary skill in the art to which this invention belongs. Although any 
methods and materials similar or equivalent to those described herein can be used in the practice or 
testing of the present invention, the preferred methods and materials are described. For purposes of 
the present invention, the following terms are defined below. 

As used herein, the twenty conventional amino acids and their abbreviations follow conventional usage 
(ImmMnPlpgy -A $VPth9?|g, 2nd Edition, E.S. Golub and D.R. Gr^en, Eds.. Sinauer Associates. 
Sunderland, Miassachusetts (1 991 ). which is incorporated herein by reference). 
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By Anucleic acid@. Aoiigonucleotide®, and Apolynucleotide@ or grammatical equivalents herein 
means at least two nucleotides covalently linked together A nucleic acid of the present invention will 
generally contain phosphodlester bonds, although in some cases nucleic acid analogs are included 
that may have altemate backbones, comprising, for example, phosphoramide (Beaucage et al.. 
5 Tetrahedron 49{10):1925 (1993) and references therein; Letsinger. J. Org. Chem. 35:3800 (1970); 
Sprinzl et aL. Eur. J. Biochem. 81:579 (1977); Letsinger et al.. Nucl. Acids Res. 14:3487 (1986); Sawai 
et al. Chem. Lett. 805 (1984), Letsinger et al., J. Am. Chem. See. 1 10:4470 (1988); and Pauwels et a!.. 
Chemica Scripta 26:141 91986)). phosphorothioate. phosphorodithioate. O-methyiphophbroamidite 
linkages (see Eckstein. Oligonucjeotides and Analogues: A Practical Approach. Oxford University 

10 Press), and peptide nucleic ackJ backbones and linkages (see Egholm. J. Am. Chem. Soc. 1 14:1895 
(1992); Meier et a!.. Chem. Int. Ed. EngL 31:1008 (1992); Nielsen, Nature. 365:566 (1993); Carlsson et 
al.. Nature 380:207 (1 996). all of which are incorporated by reiference). These modificattons of the 
ribose-phosphate backbone or bases may be done to facilitate the addition of other moleUes such as 
chemical constituents, including 2' O-methyl and 5' modified substituents, as discussed below, or to 

1 5 increase the stability and half-life of such molecules in physiological environments. 

The nucleic acids may be single stranded or double stranded, as specified, or contain portions of both 
double stranded or single stranded sequence. The nucleic acid may be DNA. both genomic and 
cDNA. RNA or a hybrid, where the nucleic acid contains any combination of deoxyribo-and ribo- 
nucleotides, and any combination of bases, including uracil, adenine, thymine, cytosine. guanine, 
20 inosine, xathanine and hypoxathanine, etc. Thus, for example, chimeric DNA-RNA molecules may be 
used such as described in Cole-Strauss et al., Science 273:1386 (1996) and Yoon et al., PNAS USA 
93:2071 (1 996), both of which are hereby incorporated by reference. 

* In general, the targeting polynucleotides may comprise any number of structures, as long as the 
changes do not substantially effect the functional ability of the targeting polynucleotide to result in 
25 homologous recombination. For example, recombinase coating of alternate structures should still be 
able to occur. 

fis used herein, the terms Apredetermined endogenous DNA sequence" and "predetenmlned target 
sequence" refer to polynucleotide sequences contained In a target cell. Such sequences Include, for 
example, chromosomal sequences (e.g.. structural genes, regulatory sequences including promoters 
30 and enhancers, recombinatorial hotspots, repeat sequences, integrated proviral sequences, hairpins, 
palindromes), episomal or extrachromosomal sequences (e.g.. replicable plasmids or viral or parasitic 
replication intermediates) including chloroplast and mitochondrial DNA sequences. By 
"predetermined" or Apre-selected® it is meant that the target sequence may be selected at the 
discretion of the practitioner on the basis of known or predicted sequence information, and is not 
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constrained to specific sites recognized by certain site-specific recombinases (e.g.. FLP recombinase 
or CRE recombinase). In some embodiments, the predetermined endogenous DNA target sequence 
will be other than a naturally occumng gemiline DNA sequence (e.g., a transgene. parasitic, 
mycoplasmal or viral sequence). An exogenous polynucleotide is a polynucleotide which is 
transfenred info a target cell but which has not been replicated in that host cell; for example, a virus 
genome polynucleotide that enters a cell by fusion of a virion to the ceH is an exogenous 
polynucleotide, however, replicated copies of the viral polynucleotide subsequently made In the 
infected cell are endogenous sequences (and may. for example, become integrated Into a cell 
chromosome). Similarly, transgenes which are microinjected or b^nsfected into a cell are exogenous 
polynucleotides, however integrated and replicated copies of the transgene{s) are endogenous 
sequences. 

The term "corresponds to" is used herein to mean that a polynucleotide sequence is homologous (i.e.. 
may be similar or identical, not strictly evolutionariiy related) to all or a portion of a reference 
polynucleotide sequence, or that a polypeptide sequence is identical to a reference polypeptide 
sequence. In contradistinction, the temi "complementary to" is used herein to mean that the 
complementary sequence is homologous to all or a portion of a reference polynucleotide sequence. 
As outlined below, preferably, the homology Is at least 50-70%. preferably 85%, and more preferably 
95% identical. Thus, the complementarity between two single-stranded targeting polynucleotides need 
not be perfect. For illustration, the nucleotide sequence "TATAC" conesponds to a reference 
sequence "TATAC@ and is perfectly complementary to a reference sequence "GTATA". 

The temis "substantially corresponds to" or "substantial identity" or Ahomologous® as used herein 
denotes a characteristic of a nucleic acid sequence, wherein a nucleic acid sequence has at least 
about 60 percent sequence identity as compared to a reference sequence, typically at least about 75 
percent sequence Identity, and preferably at least about 95 percent sequence Identity as compared to 
a reference sequence. The percentage of sequence identity is calculated excluding small deletions or 
additions which total less than 25 percent ofthe reference sequence. The reference sequence may 
be a subset of a larger sequence, such as a portion of a gene or flanking sequence, or a repetitive 
portion of a chromosome. However, the reference sequence is at least 12-18 nucleotides lorig, 
typically at least about 30 nucleotides long, and preferably at least about 50 to 100 nucleotides long. 
ASubstantially complementary" as used herein refers to a sequence that is complementary to a 
sequence that substantially corresponds to a reference sequence. In general, targeting efficiency 
increases with the length of the targeting polynucleotide portion that is substanUally complementary to 
a reference sequence present in the target DNA. 
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"Specific hybridization® is defined herein as the formation of hybrids between a targeting 
polynucleotide (e.g., a polynucleotide of the invention which may include substitutions, deletion, and/or 
additions as compared to the predetermined target DNA sequence) and a predetermined target DNA. 
wherein the targeting polynucleotide preferentially hybridizes to the predetermined target DNA such 

5 that, for example, at least one discrete band can be identified on a Southern blot of DNA prepared 
from target cells that contain the target DNA sequence, and/or a targeting polynucleotide in an intact 
nucleus Jocalizes to a discrete chromosomal location characteristic of a unique or repetitive sequence. 
In some instances, a target sequence may be present in more than one target polynucleotide species 
(e.g.. a particular target sequence may occur in multiple members of a gene family or in a known 

1 0 repetitive sequence). It is evident that optimal hybridization conditions will vary depending upon the 
sequence composition and length(s) of the targeting polynucleotide(s) and target{s), and the 
experimental method selected by the practitioner. Various guidelines may be used to select 
appropriate hybridization condiUons (see. Maniatis et al.. Molecular Cionina: A Laboratory Mgnugl 
(1989), 2nd Ed.. Cold Spring Harbor. N.Y. and Berger and Kimmel, Methods in Enzymploqy. VolMrpe 

15 i f^Mlde to Moiecular Clonino Techniques (1987). Academic Press. Inc., San Diego, CA.. which are 
incorporated herein by reference. Methods for hybridizing a targeting polynucleotide to a discrete 
chromosomal location in intact nuclei are provided herein in the Detailed Description. 

The terrn "naturally-occurring® as used herein as applied to an object refers to the f^ct that an object 
can be found In nature. For example, a polynucleotide sequence that is present in an organism 
20 (including viruses) that can be isolated from a source in nature and which has not been intentionally 
modified by man In the laboratory is naturally-occumng. 

A metabolically-active cell is a cell, comprising an intact nucleoid or nucleus, which, when provided 
nutrients and incubated in an appropriate medium canies out DNA synthesis and RNA for extended 
periods (e.g.. at least 12-24 hours). Such metabolically-active cells are typically undifferentiated or 
25 differentiated cells capable or Incapable of further cell division (although non-dividing cells many 
undergo nuclear division and chromosomal replication), although stem cells and progenitor cells are 
also metabolicailly-active cells. 

As used herein, the term "disease allele© refers to ah allele of a gene which is capable of producing a 
recognizable disease. A disease allele may be dominant or recessive and tnay produce disease 
30 directly or when present in combination with a specific genetic background or pre-existing pathological 
condition. A disease allele may be present in the gene pool or may be generated de novo in an 
individual by somatic mutation. For example and not limitation, disease to alleles include: agtivated 
oncogenes, a sickle cell anemia allele, a Tay-Sachs allele, a cystic fibrosis allele, a Lesch-Nyhan 
allele, a retinoblastoma-susceptibility allele, a Fabry's disease allele, and a Huntington's chorea allele. 
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As used herein, a disease allele encompasses both alleles associated with human diseases and 
alleles associated with recognized veterinary diseases. For example, the AF508 CFTR allele in a 
human disease allele which is associated with cystic fibrosis in North Americans. 

As used herein, the term "cell-uptake component® refers to an agent which, when bound, either 
directly or indirectly, to a targeting polynucleotide, enhances the intracellular uptake of the targeting 
polynucleotide into at least one cell type (e.g.. hepatocytes). A cell-uptake component may include, 
but is not limited to. the following: specific cell surface receptors such as a galactose-terminal (asialo-) 
glycoprotein capable of being internafeed into hepatocytes via a hepatocyte asialoglycoprotein 
receptor, a polycation (e.g.. poly-L-lysine). and/or a protein-lipid complex formed with the targeting 
polynucleotide. Various combinations of the above, as well as alternative cell-uptake components will 
be apparent to those of skill in the art and are provided in the published literature. 

DETAILED Pg SCRiPTIQN 

Generally, the nomenclature used hereafter and the laboratory procedures in cell culture, molecular 
genetics, and nucleic add chemistry and hybridization described befow are those well known and 
commonly employed in the art. Standard techniques are used for recombinant nudefe acid methods, 
polynucleotide synthesis, cell culture, and transgenesis. Generally enzymatic reacttons. 
oligonucleotide synthesis, oligonucleotide modification, and purification steps are perfomied according 
to ttie manufacturer's specifications. The techniques and procedures are generally performed 
according to conventional methods in the art and various general references which are provided 
throughout mis document. The procedures therein are believed to be well known in the art and are 
provWed for the convenience of the reader. All the information contained therein is incorporated 
herein by reference. 

Transgenic mice are derived according to Hogan. et al.. "Manipulating the Mouse Embryo: A 
Laboratory Manual®. Cold Spring Harbor Laboratory (1988) whfch is incorporated herein by 
reference. 

Embryonic stem cells are manipulated according to published procedures (Teratocareinomas and 
embryonic stem cells: a practical approach, E.J. Robertson, ed., IRL Press, Washington. D.C., 1987; 
zpistra et al.. Ngfaire 342:435-438 (1 989); and Schwartzberg et al.. Science 24g:799-803 (1989), each 
of whk:h Is incorporated herein by reference). 
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Zygotes are manipulated according to known procedures; for example see U.S. Patent No. 4,873,191. 
Brinster et al., PNAS 86:7007 (1989); Susulic et al., J. Biol. Chem. 49:29483 (1995). and Cavard et a!.. 
Nucleic Acids Res. 16:2099 (1988). hereby incorporated by reference. 

Oligonucleotides can be synthesized on an Applied Bio Systems oligonucleotide synthesizer according 
5 to specifications provided by the manufecturer. Modified oligonucleotides and peptide nucleic acids 
are made as Is generally known in the art 

The present invention provkles methods for targeting and altering, by homologous recombination, a 
pre-selected target nucleic acid sequence in a target cell, to make targeted sequence modifications. 
The methods comprise introducing into the target cells a recombinase and at least two single-stranded 
1 0 targeting polynucleotides whfch are substantially complementary to each other. The targeting 
polynucleotides each comprise at least one homology clamp that substantially corresponds to or is 
substantially complementary to the preselected target nucleic acid sequence. The target cells are 
then screened to identify target cells containing the targeted sequence modification. 

Taraetina Po lynucleotides 

1 5 Targeting polynucleotides may be produced by chemical synthesis of oligonucfeotides, nick-translation 
of a double-stranded DNA template, polymerase chain-reactton amplification of a sequence (or ligase 
chain reaction amplification), purifteation of prokaiyotic or target cloning vectors harboring a sequence 
of Interest (e.g.. a cloned cDNA or genomic ctone. or portion thereof such as plasmids. phagemids, 
YACs, cosmids, bacteriophage DNA, other viral DMA or replication intermediates, or purified restriction 

20 fragments thereof, as well as other sources of single and double-stranded polynucleotides having a 
desired nucleotide sequence. Targeting polynucleotides are generally ssDNA or dsDNA, most 
preferably two complementary single^stranded DNAs. 

Targeting polynucleotides are generally at least about 2 to 100 nucleotides long, preferably at least 
about 6-to 100 nucleotides long, at least about 250 to 500 nucleotides long, more preferably at least 

25 about 500 to 2000 nucleotides long, or longer however, as the length of a targeting polynucleotide 
increases beyond about 20.000 to 50,000 to 400.000 nucleotides, the efficiency or transfening an 
Intact targeting polynucleotide into the cell decreases. The length of homology may t?e selected at the 
discretion of the practitioner on the basis of the sequence composition and complexity of the 
predetermined endogenous target DNA sequence(s) and guidance provided in the art. which generally 

30 indicates that 1.3 to 6.8 kilobase segments of homology are prefen^ed (Hasty et al. (1991) Moiec. Cell , 
Biol. 11: 5586; Shulman etal. (1990) Molec. Cell. Biol. 10: 4466, which are incorporated herein by 
reference). Targeting polynucleotides have at least one sequence that substantially corresponds to. or 
is substantially complementary to. a predetermined endogenous DNA sequence.(i.e.. a DNA 
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sequence of a polynucleotide located in a target cell, such as a chromosomal, mitochondrial, 
chtoroplast. viral, episomal. or mycoplasmal polynucleotide). Such targeting polynucleotide 
sequences serve as templates for homologous pairing with the predetermined endogenous 
sequence(s). and are also referred to herein as homology clamps. In targeting polynucleotides, such 
homology clamps are typically located at or near the 6' or 3' end. preferably homology clamps are 
internally or located at each end of the polynucleotide (Berinstein etal. (1992) Molec. Call Rir;>i , ip- 
360. which Is incorporated herein by reference). Without wishing to be bound by any particular theory, 
it is believed that the addition of recombinases permits efficient gene targeting with targeting 
polynucleotides having short (i.e.. about 50 to 1000 basepair long) segments of homology, as well as 
with targeting polynucleotides having longer segments of homology. 

Therefore, it is preferred that targeting polynucleotides of the invention have homology clamps that are 
highly homologous to the predetermined target endogenous DNA sequence{s). most preferably 
isogenic. Typically, targeting polynucleotides of the invention have at least one homology clamp that 
is at least about 18 to 35 nucleotides long, and It is preferable that homology clamps are at least about 
20 to iOO nucleotides long, and more preferably at least about 100-500 nucleotides long, although the 
degree of sequence homology between the homology clamp and the targeted sequence and the base 
composition of the targeted sequence will detemnine the optima! and minimal clamp lengths (e.g.. G-C 
rich sequences are typically more thermodynamically stable and will generally require shorter damp 
length). Therefore, both homology clamp length and the degree of sequence homology can only be 
detemiined with reference to a particular predetermined sequence, but homology clamps generally 
must be at least about 12 nucleotides long and must also substantially correspond or be substantially 
complementary to a predetemiined target sequence. Preferably, a homology clamp is at least about 
12. and preferably at least about 50 nucleotides long and is identical to or complementary to a 
predetemiined target sequence. Without wishing to be bound by a particular theory, it is believed that 
the addition of recombinases to a targeting polynucleotide enhances the efficiency of homologous 
recombination between homologous, nonisogenic sequences (e.g.. between an exon 2 sequence of a 
albumin gene of a Balb/c mouse and a homologous albumjn gene exon 2 sequence of a C57/BL6 
mouse), as well as between Isogenic sequences. 



30 



The fomiation of heteroduplex joints is not a stringent process; genetic evidence supports the view 
that the classical phenomena of meiotic gene conversion and aberrant meiotic segregation result in 
part from the inclusion of mismatched base pairs in heteroduplex joints, and the subsequent correction 
of some of these mismatched base pairs before replication. Observations on recA protein have 
provided information on parameters that affect the discrimination of relatedness from perfect or near- 
periect homology and that affect the inclusion of mismatched base pairs in heteroduplex joints. The 
35 ability of recA protein to drive strand exchange past all single base-pair mismatches and to form 
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extensively mismatched joints in supertielical DNA reflect its role in recombination and gene 
conversion. This error-prone process may also be related to its role in mutagenesis. RecA-mediated 
pairing reactions involving DNA of NX174 and G4. which are about 70 percent homologous, have 
yielded homologous recombinants (Cunningham etal. (1981) £fiU24: 213). although recA 

5 preferentially forms homologous joints between highly homologous sequences, and is implicated as 
mediating a homology search process between an invading DNA strand and a recipient DNA strand, 
producing relatively stable heteroduplexes at regions of high homology. Accordingly, it is the fact that 
recombinases can drive the homologous recombination reaction between strands which are 
sgnificantly, but not perfectly, homologous, which allows gene conversion and the modification of 

10 target sequences. Thus, targeting polynucleotides may be used to introduce nucleotide substitutions, 
insertions and deletions into an endogeneous DNA sequence, and thus the corresponding amino acid 
substitutions, insertions and deletions in proteins expressed from the endogeneous DNA sequence. 

In a preferred embodiment, two substanBally complementary targeting polynucleotides are used. In 
one embodiment, the targeting polynucleotides form a double stranded hybrid, which may be coated 
1 5 with recombinase. although when the recombinase is recA. the loading condifions may be somewhat 
different from those used for single stranded nucleic adds. 

In a prefered embodiment, two substantially complenrwntary single-stranded targeting polynucleotides 
are used. The two complenrentary single-stranded targeting polynucleotides are usually of equal 
length, although this Is not required. However, as noted below, the stability of the four strand hybrids 

20 of the invention is putatively related, in part, to the lack of significant unhybridized single-stranded 

nucleic acid, and thus significant unpaired sequences are not prefen-ed. Furthermore, as noted above, 
the complementarity between the two targeting polynucleotides need not be perfect. The two 
complementary single-stranded targeting polynucleotides are simultaneously or contemporaneously 
introduced into a target cell harboring a predetermined endogenous target sequence, generally with at 

25 lease one recombinase protein (e.g.. recA). Under most circumstances, it is preferred that the 
targeting polynucleotides are incubated with recA or other recombinase prior to Introduction into a 
target cell, so that the recombinase protein(s) may be "loaded" onto the targeting polynucleotide(s). to 
coat the nucleic acid, as is described below. Incubation conditions for such reoonnbinase loading are 
described Infra, and also in U.S.S.N. 07/755,462, filed 4 September 1991; U.S.S.N. 07/910.791. filed 9 

30 July 1992; and U.S.S.N. 07/520.321, filed 7 May 1990. each of wrtiich is Incorporated herein by 

reference. A targeting polynucleotide may contain a sequence that enhances the loading process of a 
recombinase. for example a recA loading sequence is the recombinogenic and recombinase 
nucleation sequence polytd(A-C)J and its complement, poly(d(G-T)J. The duplex sequence oligold(A- 
C)„ •d{G-T)J. where n is from 4 to 35, is a middle repetitive element in target DNA. 
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There appears to be a fundamental difference in the stability of RecA-protein-mediated D-loops formed 
between one single-stranded DNA (ssDNA) probe hybridized to negatively supercoiled DNA targets in 
comparison to relaxed or linear duplex DNA targets. Internally located dsDNA target sequences on 
relaxed linear DNA targets hybridized by ssDNA probes produce single D-loops. which are unstable 
5 after removal of RecA protein (Adzuma. Genes Devel. 6:1679 (1992); Hsieh et al. PNAS USA 89:6492 
(1992);Chiuetal., Biochemistry 32:13146(1993)). This probe DNA instability of hybrids formed with 
linear duplex DNA targets is most probably due to the incoming ssDNA probe WC base pairing with 
the complementary DNA strand of the duplex target and disrupUng the base pairing in the other DNA 
strand. The required high free-energy of maintaining a disrupted DNA strand in an unpaired ssDNA 
10 confomiation in a protein-free sIngle-D-loop apparently can only be compensated for either by the 
stored free energy Inherent in negatively supercoiled DNA targets or by base pairing initiated at the 
distal ends of the joint DNA molecule, allowing the exchanged strands to freely intertwine. 

However, the addition of a second complementary ssDNA to the three^nd-containing single-D-loop 
stabilizes the deproteinized hybrid joint molecules by allowing W-C base pairing of the probe with the 
1 5 displaced target DNA strand. The addition of a second RecA-coated complementary ssDNA 
(cssDNA) strand to the three-strand containing single D-loop stabilizes deproteinized hybrid joints 
located away from the free ends of the duplex target DNA (Sena & Zariing, Nature Genetics 3:365 
(1993); R6vet et al. J. Mol. Biol. 232:779 (1993); Jayasena and Johnston. J. Mol. Bio. 230:1015 
(1993)). The resulting four-stranded structure, named a double D-loop by analogy with the three- 
stranded single D-loop hybrid has been shown to be stable in the absence of RecA protein. This 
stability likely occurs because the restoration of VW; basepairing in the parental duplex would require 
disruption of two W-C basepairs in the douWe-D-loop (one VW: pair in each heteioduplex D-loop). 
Since each base-pairing in the reverse transition (double-D-loop to duplex) is less favorable by the 
energy of one W-C basepair. the pair of cssDNA probes are thus kinetically trapped in duplex DNA 
; targets in stable hybrid structures. The stability of the double-D loop joint molecule within internally 
located probe:target hybrids is an intermediate stage prior to the progression of the homologous 
recombination reaction to the strand exchange phase. The dpuble D-loop permits isolation of stable 
multistranded DNA recombination intemiediates. 

In addition, when the targeting polynucleotides are used to generate insertions or deletions in an 
endogeneous nucleic acid sequence, the use of two complementary single-strande'd targeting 
polynucleotides allows the use of internal homology clamps as depicted in Figure 13. The use of 
internal homology clamps allows the fbnnation of stable deproteinized cssDNAiprobe target hybrids 
with homologous DNA sequences containing either relatively small or large insertions and deletions 
within a homologous DNA target. Without being bound by theory, it appears that these probe:target 
hybrids, with heterologous inserts in the cssDNA probe, are stabilized by the re-annealing of cssDNA 
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probes to each other within the double-D-loop hybrid, forming a novel DNA structure with an internal 
homology clamp. Similarly stable double-D-loop hybrids fanned at intemal sites with heterologous 
inserts in the linear DNA targets (with respect to the cssDNA probe) are equally stable: Because 
cssDNA probes are kinetically trapped within the duplex target, the multi-stranded DNA intermediates 
5 of homologous DNA pairing are stabilized and strand exchange is facilitated. 

In a preferred embodiment, the length of the intemal homology clamp <i.e. the length of the insertion or 
deletion) is from about 1 to 50% of the total length of the targeting polynucleotide, with from about 1 to 
about 20% being preferred and from about 1 to aboul 10% being especially preferred, although in 
some cases the length of the deletion or insertion may be significantly larger. As for the targeting 
10 homology clamps, the complementarity within the intemal homology clamp need not be perfect 

The invention may also be practiced with individual targeting polynucleotides which, do not comprise 
part of a complementary pair. In each case, a targeting polynucleotide is introduced into a target cell 
simultaneously or contemporaneously with a recombinase protein, typically in the fomi of a 
recombinase coated targeting polynucleotide as outlined herein (i.e., a polynucleotide pre-incubated 
15 with recombinase wherein the recombinase is noncovalently bound to the polynucleotide; generally 
referred to In the art as a nucleoprotein filament). 

A targeting polynucleotide used in a method of the invention typically is a single-stranded nucleic acid, 
usually a DNA strand, or derived by denaturation of a duplex DNA, which is complementary to one (or 
both) strand(s) of the target duplex nucleic acid. Thus, one of the complementary single stranded 

20 targeting polynucleotides is complementary to one strand of the endogeneous target sequence (i.e. 
Watson) and the other complementary single stranded targeting polynucleotide is complementary to 
the other strand of the endogeneous target sequence (I.e. Crick). The homology clamp sequence 
preferably contains at least 90-95% sequence homology with tt^e target sequence, to insure 
sequence-specific targeting of the targeting polynucleotide to the endogenous DNA target. Each 

25 single-stranded targeting polynucleotide is typically about 50-600 bases long, although a shorter or 
^ longer polynucleotide may also be employed. Altematively. targeting polynucleotides nnay be 
prepared in single-stranded form by oligonucleotide synttiesis methods, which may first require, 
especially with larger targeting polynucleotides, fonnation of subfragments of tiie targeting 
polynucleotide, typically followed by splicing of the subfragments together, typically by enzymatic 

30 ligation. 

Rw?mbin9gg Prpteing 
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Recombinases are proteins that, when included with an exogenous targeting polynucleotide, provide a 
measurable Increase in the recombination frequency and/or localization frequency between the 
targeting polynucleotide and an endogenous predetermined DMA sequence. Thus, in a preferred 
embodiment, increases in recombination frequency from the normal range of 10-« to lO-". to 10"^ to 10\ 
5 preferably 10^ to 10'. and most preferably 10* to 10', may be acheived. 

In the present invention, recombinase refers to a family of RecA^ike recombination proteins all having 
essentially all or most of the same functions, particularly: (i) the recombinase protein's ability to 
properly bind to and position targeting polynucleotides on their homologous targets and (ii) the ability 
of recombinase protein/targeting polynucleotide complexes to efiicienOy find and bind to 
complementary endogenous sequences. The best characterized recA protein Is from £ cdi. in 
addition to the wild-type protein a number of mutant recA-like proteins have been identified (e.g.. 
recA803; see Madiraju et a!.. PNAS USA 85{18):6592 (1988); Madiraju et al. Biochem. 31:10529 
(1992); Lavery et al.. J. Bid. Chem. 267:20648 (1992)). Further, many organisms have recA-like 
recombinases with strand-transfer activities (e.g., Fugisawa et al.. (1985) Nucl. Acids Rp^ 7473. 
Hsiehetal..(1986)CMM: 885: Hsiehetal.. (1989) J,JiaL£t^ ' 
PrW-N9tl.Acatl.Sq- , {\m fiS: 3683; Cassuto et al.. (1987) Mol. Gen fipnpt 99a- 10; Ganea et al.. 
(1987) Mol. Cell Biol. Z: 3124; Moore etal.. (1990)ABiQL£t]snLJS: 11108; Keeneetal., (1984)i!!iffiL 
AddsBes. 12: 3057; Kimeic. (1984) Cold Sorinn H^rhnrSymp ^p- 675; Kmeic. (1986) CgH^: 545; 
Kolodneretal.. (1987) Prog, Natl, Aca^< Scj I ISAM: 5560; Suginoetal., (1985) Proc. Natl Ar^ri SrJ 
20 im 55: 3683; Halbrook et al.. (1989) QisLChepL 264: 21403; Eisen et al.. (1988) Prt>c. Natl Ar^- 
SeLUSA85: 7481; McCarthy eta!.. (1988) Proc. Natl Arpri ..^n M.ci/^B^- 5354; Lowenhaupt et al.. 
(1989) J, Biol. Chem, 264: 20568. which are incorporated herein by reference. Examples of such 
recombinase proteins include, for example but not limitation: recA. recA803. uvsX, and other recA 
mutants and recA-like recombinases (Roca, A. I. (1990) Crit Rev. Rinrhpn, hy| » |oo p p, ^ 4^5) 
25 ^(Kotodneretal. (1987) prpg. Natl- ACPrt SPI fl > S A ) M:5560: Tishkoff et al. Molec. CpII Rini 
11:2593), RuvC (Dunderdale et al. (1991) liatuiB 2Sd: 506). DST2. KEM1, XRN1 (Dykstra et al. 

(1991) Molgc.Cgl| , Rio(, 11:2583). STP"/DST1 (Clarketal. (1991) MslSSL£slLBi2L 11:2576). HPP-1 
(Moore et al. (1991) Prop, Ngtl, Aff^d SqI (lis A) aS:9067). other target lecombinases (Bishop et al. 

(1992) eel! g2: 439; Shinohara et al. (1992) CeH 69: 457); incorporated herein by reference. RecA 
30 may be purified from E. co// strains, such as E. co// strains JC12772 and JC15369 (available from A. J. 

Clark and M. Madiraju. University of California-Berkeley, or purchased commercially). These strains 
contain the recA coding sequences on a -mnaway" replicating plasmid vector present at a high copy 
nun*ers per cell. The recA803 protein is a high^ctivity mutant of wild-type recA. The art teaches 
severalexamples Of recombinase proteins, for example, from Drosophila. yeast, plant, human, and 
non-human mammalian cells, including proteins with biotogk:al properties similar to recA (i.e., recA-like 
recombinases), such as RadSI from mammals and yeast, and Pkwec (see RashM et al.. Nucleic AckJ 
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Res. 25(4):719 (1997), hereby incorporated by reference). In addition, the recombinase may actually 
be a complex of proteins, i.e. a Arecombinosome®. In addition, included within the definition of a 
recombinase are portions or fragments of recombinases which retain recombinase biological activity, 
as well as variants or mutants of wild-type recombinases which retain biological activity, such as the E. 
coli recA803 mutant with enhanced recoinbinase activity. 

In a preferred embodiment. recA or radSI is used. For example. recA protein is typically obtained 
from bacterial strains that overproduce the protein: wild-type £. co/i recA protein and mutant recA803 
protein may be purified from such strains. Alternatively. recA protein can also be purchased flPom. for 
example, Pharmacia (Piscataway. NJ). 

RecA proteins, and Its homologs. form a nudeoprotein filament when it coats a single-stranded DNA. 
In this nudeoprotein filament, one monomer of recA protein is bound to about 3 nucleotides. This 
property of recA to coat single-stranded DNA is essentially sequence independent, although particular 
sequences favor initial loading of recA onto a polynudeotide (e.g.. nucleation sequences). The 
nudeoprotein filament(s) can be formed on essentially any DNA molecule and can be formed in cells 
(e.g., mammalian cells), forming complexes with both single-stranded and double-stranded DMA. 
although the loading conditions for dsDNA are somewhat different than for ssDNA. 

pft y^ n^hlnase m ating of Tarofttino PolvnudeotideS 

The conditions used to coat targeting polynudeotides with recombinases such as recA protein and 
ATP(S have been described in commonly assigned U.S.S.N. 07/910.791, filed 9 July 1992; U.S.S.N. 
07/755.462, filed 4 September 1991; and U.S.S.N. 07/520.321, filed 7 May 1990, each incorporated 
herein by reference. The procedures below are direded to the use of E. coli rec^ although as will be 
appreciated by those in the art. other recombinases may be used as well. Targeting polynudeotides 
can be coated using GTP(S. mixes of ATP(S with rATP. rGTP and/or dATP. or dATP or rATP alone in 
the presence of an rATP generating system (Boehringer Mannheim). Various mbrtures of GTP(S. 
ATP(S. ATP. ADP. dATP and/or rATP or other nucleosides may be used, particuiariy preferred are 
mixes of ATP(S and ATP or ATP(S and ADP. 

RecA protein coating of targeting polynudeotides is typically canled out as described in U.S.S.N. 
07/910,791. filed 9 July 1992 and U.S.S.N. 07/755.462. filed 4 September 1991, which are 
incorporated herein by reference. Briefly, the targeting polynucleotide, whether double-stranded or 
single-stianded. is denatured by heating in an aqueous solution at 95-100'C for five minutes, then 
placed in an ice bath for 20 seconds to about one minute followed by centrifugation at O'C for 
approximately 20 sec. before use. When denatured targeting polynudeotides are not placed in a 
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freezer at -lO'C they are usually immediately added to standard recA coating reaction buffer 
containing ATP(S, at room temperature, and to this is added the recA protein. Alternatively. recA 
protein may be included with the buffer components and ATP(S before the polynucleotides are added. 

RecA coating of targeting polynucleotide(s) is initiated by incubating potynucleotide-recA mixtures at 
5 37'C for 10-15 min. RecA protein concentration tested during reaction with polynucleotide \«ries 
depending upon polynucleotide size and the amount of added polynucleotide, and the ratio of recA 
molecule.nucleotide preferably ranges between about 3:1 and 1:3. When single-stianded 
polynucleotides are recA coated independently of their homologous polynucleotide strands, the mM 
and MM concentrations of ATP(S and recA. respectively, can be reduced to one-half those used with 
0 double-stranded targeting polynucleotides (i.e., recA and ATP(S concentration ratios are usually kept 
constant at a specific concentration of individual polynucleotide strand, depending on whether a single- 
or double-stranded polynucleotide is used). 

RecA protein coating of targeting polynucleotides is normally carried out in a standard 1X RecA 
coating reaction buffer. 10X RecA reaction buffer (l.e.. lOx AC buffer) consists of: 100 mM Tris 
5 acetate (pH 7.5 at 37'C). 20 mM magnesium acetate. 600 mM sodium acetate. 10 mM DTT, and 50% 
glycerol). All of the targeting polynucleotides, whether double-stranded or single^stranded. typically 
are denatured before use by heating to 95-100-C for five minutes, placed onice for oneminute. and 
subjected to centrifugation (10.000 rpm) at OX for approximately 20 seconds (e.g., in a Tomy 
centrifuge). IDenatured targeting polynucleotides usually are added immediately to room temperature 
RecA coating reaction buffer mixed with ATP{S and diluted with buffer or double-distilled HjO as 
necessary. 

A reaction mixture typically contains the.fbllowing components: (i) 0.2-4.8 mM ATP(S; and (ii) between 
1-100 ng/pl of targeting polynucleotide. To ttiis mixture is added about 1 -20 pi of recA protein per 1 0- 
100 pi of reaction mixture, usually at about 2-10 mg/ml (purchased from Pharmacia or purified), and is 
rapidly added and mixed. The final reaction volume-fbr RecA coating of targeting polynucleotide is 
usually in the range of about 10-500 pi. RecA coating of targeting polynucleotide is usually initiated by 
incubating targeting polynucleotide-RecA mixtures at 37'C for about 10-1 5 mIn. 

RecA protein concentrations in coating reactions varies depending upon targeting polynucleotide size 
and the amount of added targeting polynucleotide: recA protein concentrations are typically in the 
range of 5 to 50 pM. When single-stranded targeting polynucleotides are coated with recA. 
independently of their complenrentary strands, the concentrations of ATP(S and recA protein may 
optionally be reduced to about one-half of the concentrations used with double-stranded targeting 
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polynucleotides of the same length: that Is, the recA protein and ATP(S concentration ratios are 
generally kept constant for a given concentration of individual polynucleotide strands. 

The coating of targeting polynucleotides with recA protein can be evaluated in a numl>er of \ways. 
First, protein binding to DNA can be examined using band-shift gel assays (McEntee et al.. (1981) i 
5 Biol. Chem. 256: 8835). Labeled polynucleotides can be coaled with recA protein in the presisnce of 
ATP(S and the products of the coating reactions may be separated by agarose gel electrophoresis. 
Following incubation of recA protein with denatured duplex DNAs the recA protein effectively coats 
single-stranded targeting polynucleotides derived frpm denaturing a duplex DNA. As the ratio of recA 
protein monomers to nucleotides in the targeting polynucleotide increases from 0, 1:27. 1:2.7 to 3.7:1 
10 for 121-mer and 0, 1:22. 1:2.2 to 4.5:1 for 159-mer. targeting polynucleotide's electrophoretic mobility 
decreases, i.e., is retarded, due to recA-binding to the targeting polynucleotide. Retardation of the 
coated polynucleotide's mobility reflects the saturation of targeting polynucleotide with recA protein. 
An excess of recA monomers to DNA nucleotides is required for efficient recA coating of short 
targeting polynucleotides (Leahy et al.. (1986) J. Biol. Chem. 2&1: 954). 

15 A second method for evaluating protein binding to DNA is in the use of nitrocellulose filter binding 
assays (Leahy et al., (1986) J. Biol. Chem. 2§1:6954; Woodbury, et al.. (1983) PiQQhgmiStfY 
22(20):4730-4737. The nitrocellulose filter binding method is particularly useful in determining the 
dissociation-rates for proteinrDNA complexes using labeled DNA. In the filter binding assay, 
DNA:protein complexes are retained on a filter while free DNA passes through the filter. This assay 

20 method is more quantitative for dissociation-rate determinations because the separation of 
DNA:protein complexes from free targeting polynucleotide is very rapid. 

Alternatively, recombinase protein(s) (prokaryotic, eukaryotic or endogeneous to the target cell) may 
be exogenousiy induced or administered to a target cell simultaneously or contemporaneously (i.e., 
within about a few hours) with the targeting polynucleotide(s). Such administration is typically done by 

25 micro-injection, although electroporation. lipofection. and other transfection methods known in the art 
may also be used. Alternatively, recombinase-proteins may be produced in ylsffl. For example, they 
may be produced from a homologous or heterologous expression cassette in a transfected cell or 
transgenic cell, such as a transgenic totipotent cell (e.g. a fertilized zygote) or an embryonal stem cell 
(e.g., a murine ES cell such as AB-1) used to generate a transgenic non-human animal line or a 

30 somatic cell or a pluripotent hematopoietic stem cell for reconstituting all or part of a particular stem 
cell population (e.g. hematopoietic) of an individual. Conveniently, a heterologous expression cassette 
includes a modulatable promoter, such as an ecdysone-inducible promoter-enhancer combination, an 
estrogen-induced promoter-enhancer combination, a CMV promoter-enhancer, an insulin gene 
promoter, or other cell-type specific, developmental stage-specific, hormone-inducible. or other 
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modulatable promoter construct so that expression of at least one species of recombinase protein 
from the cassette can by modulated for transiently producing recombinase(s) M simultaneous or 
contemporaneous with Introduction of a targeting polynucleotide into the cell. When a 
homione-induclble promoter^nhancer combination Is used, the cell must have the required hormone 
receptor present, either naturally or as a consequence of expression a co-transfected expression 
vector encoding such receptor. Alternatively, the recombinase may be endogeneous and produced ii 
high levels. In this embodiment, preferably in eukaryotic target cells such as tumor cells, the target 
cells produce an elevated level of recombinase. In other embodiments the level of recombinase may 
be induced by DNA damaging agents, such as mitomycin C. UV or (nrradiation. Alternatively, 
recombinase levels may also be elevated by transfection of a virus or plasmid encoding the 
recombinase gene into the cell. 



in 



Cell-Uptake Components 

A targeting polynucleotide of the Invention may optionally be conjugated, typically by covalently or 
preferably noncovalent binding, to a cell-uptake component. Various methods have been described In 

15 theartfbrtargetlngDNAtospeclficcelltypes. A targeting polynucleotide of the invention can be 
conjugated to essentially any of several celkiptake components known in the art. For targeting to 
hepatocytes. a targeting polynucleofide can be conjugated to an asialoorosomucold (ASQRHwIy-L- 
lysine conjugate by methods described in the art and incorporated herein by reference (Wb GY and 
Wu CH (1987) J^JioLCheoL 222:4429; Wii GY and Wu CH (1988) fiiQjaiemjs!a2Z:887: Wu GY and 

20 WuCH(1988)J.£i2LQiigaL2S2: 14621; WuGY and WuCH (1992) UioLCbgnLZfiZ: 12436:WU 
et al. (1991) I Bipl, Chf m m. 14338; and Wilson et al. (1992) J. Biol. Ch^m 9fi7- qr-^ 
WO92/06180; WO92/05250; and W091/17761. which are Incorporated herein by reference). 

Altematively. a cell-uptake component may be fbm«d by incubating the targeting polynucleotide with 
at least one lipid species and at least one protein species to form protein-llpld-polynucleotide 
25 complexes consisting essentially of the targeting polynucleotide and the iipkJ-protein celkiptake 
component. Lipid vesicles made according to Feigner (W091/17424. Incorporated herein by 
reference) and/or cationic lipidization (WO91/16024. incorporated herein by reference) or other fbm« 
for polynucleotide administration (EP 465.529. incorporated herein by reference) may also be 
employed as cell-uptake components. Nucleases may also be used. 

30 In addltk>n to cell-uplake components, targeting components such as nuclear localizatfon signals may 
be used, as is known In the art 
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In addition to recombinase and cellular uptake components, the targeting polynucleotides may include 
chemical substituents. Exogenous targeting polynucleotides that have been modified with appended 
chemical substituents may be introduced along with recombinase (e.g.. recA) into a metabolicalty 
active target cell to homologously pair with a predetermined endogenous DNA target sequence in the 

5 cell. In a preferred embodiment, the exogenous targeting polynucleotides are derivatized. and 
additional chemical substituents are attached, either during or after polynucleotide synthesis, 
respectively, and are thus localized to a specific endogenous target sequence where they produce an 
alteration or chemical modification to a local DNA sequence. Preferred attached chemical substituents 
include, but are not limited to: cross-linking agents (see Podyminogin et al., Biochem. 34:13098 

10 (1995) and 35:7267 (1996). both of which are hereby incorporated by reference), nucleic acid 
cleavage agents, metal chelates (e.g.. iron/EDTA chelate for iron catalyzed cleavage), 
topoisomerases, endonucleases. exonucleases, ligases. phosphodiesterases, photodynamic 
porphyrins, chemotherapeutic drugs (e.g., adriamycin. doxirubicin). intercalating agents, labels, base- 
modification agents, agents which normally bind to nuclefc acids such as labels, etc. (see for example 

15 Afonina et al.. PNAS USA 93:3199 (1996), incorporated herein by reference) immunoglobulin chains, 
and oligonucleotides. Iron/EDTA chelates are particularly prefenred chemrcal substituents where tocal 
cleavage of a DNA sequence Is desired (Hertzberg et al. (1982) J. Am. Ch^m. Sqc. 1Q4: 313; 
Hertzberg and Dervan (1984) BlQchemtstrv 23: 3934; Taylor et al. (1984) Tetr9h?dron 4Q: 457; 
Dervan. PB ( 1986) Science 232 : 464, which are incorporated herein by reference). Further preferred 

20 are groups that prevent hybridization of the complementary single stranded nucleic acids to each other 
but not to unmodified nucleic acids; see for example Kutryavin et al., Biochem. 35:1 1 170 (1996) and 
Woo etal.. Nucleic Acid. Res. 24(13):2470 (1996). both of which are incorporated by reference, 2'-0 
methyl groups are also preferred: see Cole-Strauss et al„ Science 273:1386 (1996); Yoon et al.. 
PNAS 93:2071 (1996)). Additional preferred chemical substitutents include labeling moieties. 

25 including fluorescent labels. Prefenred attachment chemistries include: direct linkage. e.g.. via an 
appended reactive amino group (Corey and Schultz (1988) SsifiDCfi 238:1401. which is Incorporated 
herein by reference) and other direct linkage chemistries, although streptavidin/biotin and 
digoxigenln/antidigoxigenin antibody linkage methods may also be used. Methods for linking chemical 
substituents are provided in U.S. Patents 5.1 35.720. 5,093.245. and 6,055.556. which are 

30 incorporated herein by reference. Other linkage chemistries may be used at the discretion of the 
practitioner. 

Typically, a targeting polynucleotide of the invention is coated with at least one recombinase and is 
conjugated to a cell-uptake component, and the resulting cell targeting complex is contacted with a 
target cell under uptake conditions (e.g.. physiological conditions) so that the targeting polynudeotkle 
35 and the recombinase(s) are internalized in the target cell. A targeting polynucleotide may be 

contacted simultaneously or sequentially with a cell-uptake component and also with a recombinase; 
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preferably the targeting polynucleotide is contacted first with a recombinase, or with a mixture 
comprising both a ceilniptake component and a recombinase under conditions whereby, on average, 
at least about one molecule of recombinase is honcovalently attached per targeting polynucleotide 
molecule and at least about one celkjptake component also is noncovalently attached. Most 
preferably, coating of both recombinase and cell-uptake compbnent saturates essentially all of the 
available binding sites on the targeting polynucleotide. A targeting polynucleotide may be 
preferentially coated with a cell-uptake component so that the resultant targeting complex comprises, 
on a molar basis, more cell-uptake component than recombinase(s). Alternatively, a targeting 
polynucleotide may be preferentially coated with recombinase(s) so that the resultant targeting 
complex comprises, on a molar basis, more recombinase(s) than cell-uptake component. 

Cell-uptake components are included with recombinase-coated targeting polynucleotides of the 
inventfon to enhance the uptake of the recombinaseHX)ated targeting polynucleotide(s) into cells, 
particularly fbrio mq gene targeting applications, such as gene therapy to treat genetic diseases, 
including neoplasia, and targeted homologous recombinatun to treat viral infections wherein a viral 
sequence (e.g.. an integrated hepatitis B virus (HBV) genome or genome fragment) may be targeted 
by homologous sequence tergeting and Inactivated. Alternatively, a tergeting polynucleotide may be 
coated with the cell-upteke component and tergeted to cells with a contemporaneous or simultaneous 
administration of a recombinase (e.g.. liposomes or immunoliposomes containing a recombinase. a 
viral-based vectpr encoding and expressing a recombinase). 

3 Once the recombinase-tergeting polynucleotide compositions are formulated, they are introduced or 
administered Into target cells. The administiatton is typically done as is known for the administration of 
nuclefe acids into cells, and. as those skilled in the art will appreciate, the methods may depend on the 
chotee of the target cell. Suitable methods include, but are not limited to. mteroinjectton. 
electroporation. lipofection. ete. By Aterget cells® herein is meant prokaryotic or eukaiyotic cells. 
Suitable prokaryotic cells include, but are not limited to, bacteria such as £ coff. Bacillus species, and 
the extremophile bacteria such as thermophiles. ete. Preferably, the procaryotic target cells are ' 
recombination competent. Suitable eukaryotic cells include, but are not limited to. fungi such as yeast 
and filamentous fungi, including species of Aspergillus, Trichoderma, and Neurospora; plant cells 
including those of com. sorghum, tobacco, canola. soybean, cotton, tomato, potato. pHalfe. sunflower, 
ete.; and animal cells, including fish, birds and mammals. Suitable fish cells include, but are not 
limited to. those from species of salmon, trout, tulapia. tuna. carp, flounder, halibut, swordfish. cod and 
zebrafish. Suitable bird cells include, but are not limited to. those of chickens, ducks, quail, pheasants 
and turtceys. and other jungte fowl or game birds. Suttabte mammalian cells Include, but are not 
limited to. cells from horses, cattle, buffalo, deer, sheep, rabbits, rodente such as mice. rate, hamsters 
gerbils. and guinea pigs, minks, goate. pigs, primates, marsupials, marine mammals including dolphins 
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and whales, as well as cell lines, such as human cell lines of any tissue or stem cell type, and stem 
cells, including pluripotent and non-pluripotent, and non-human zygotes. 

In a prefen-ed embodiment, procaryotic cells are used. In this embodiment, a pre-seiected target DNA 
sequence is chosen for alteration. Preferably, the pre-selected target DNA sequence is contained 
5 within an extrachromosomal sequence. By Aextrachromosomal sequence® herein is meant a 
sequence separate from the chronwsomal or genomic sequences. Prefen^d extrachromosomal 
sequences include plasmids (particularly procaryotic plasmids such as bacterial plasmids), P1 vectors, 
viral genomes, yeast, bacterial and mammalian artificial chromosomes (YAC, BAC and MAC, 
respectively), and other autonomously self-replicating sequences, although this is not required. As 

10 described herein, a recombinase and at least two single stranded targeting polynucleotides which are 
substantially complementary to each other, each of which contain a homology clamp to the target 
sequence contained on the extrachromosomal sequence, are added to the extrachromosomal 
sequence, preferably in vitro. The two single stranded targeting polynucleotides are preferably coated 
with recombinase, and at least one of the targeting polynucleotides contain at least one nucleotide 

15 substitution, insertion or deletion. The targeting polynucleotides then bind to the target sequence in 
the extrachromosomal sequence to effect homologous recombination and form an altered 
extrachromosomal sequence which contains the substitution, insertion or deletion. The altered 
extiBchromosomal sequence is then introduced into the procaryotic cell using techniques known in the 
art Preferably, the recombinase is removed prior to introduction into the target cell, using techniques 

20 known in ttie art. For example, ttie reaction may be treated witii proteases such as proteinase K, 
detergents such as SDS, and phenol extraction (including phenol:chloroform:isoannyl alcohol 
extraction). These metiiods may also be used for eukaryotic cells. 

Alternatively, tiie pre-selected target DNA sequence is a chromosomal sequence. In ttiis embodiment, 
the recombinase with the targeting polynucleotides are introduced into the target cell, preferably 
25 eukaryotic target cells. In tiiis embodiment, it may be desirable to bind (generally non-covalenUy) a 
nuclear localization signal to the targeting polynucleotides to facilitate localization of the complexes In 
the nucleus. See for example Kido et al.. Exper. Cell Res. 198:107-114 (1992), hereby expressly 
incorporated by reference. The targeting polynucleotides and the recombinase function to effect 
homologous recombination, resulting in altered chromosomal or genomic sequence^. 

30 In a prefened embodiment, eukaryotic cells are used. For making transgenic non-human animals 
(which include homologously targeted non-human animals) embryonal stem cells (ES cells) and 
fertilized zygotes are preferred. In a preferred embodiment, embryonal stem cells are used. Murine 
ES cells, such as AB-1 line grown on mitotically inactive SNL76/7, cell feeder layers (McMahon and 
Bradley. CsllfiZ: 1073-1085 (1990)) essentially as described (Robertson. E.J. (1987) in 
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Teratocarcinomgs gnd Embryonic Stem Ceils: A Practical Approach . E.J. Robertson, ed. (oxford: IRL 
Press), p. 71-1 12) may be used for homologous gene targeting. Other suitable ES lines include, but 
are not limited to. the E14 line (Hooper et at. (1987) Nature 326 : 292-295). the D3 line (Doetschman et 
al. EmbfvoL Ext^ Morph 1^7- 21-45). and the CCE line (Robertson et al. f1986^ Nature 323: 

445-448). The success of generating a mouse line from ES cells bearing a specific targeted mutation 
depends on the pluripotence of the ES cells (i.e., their ability, once injected into a host blastocyst, to 
participate in embryogenesis and contribute to the germ cells of the resulting animal). 

The pluripotence of any given ES cell line can vary with time in culture and the care with which it has 
been handled. The only definitive assay for pluripotence is to determine whether the specific 
population of ES cells to be used for targeting can give rise to chimeras capable of germline 
transmission of the ES genome. For this reason, prior to gene targeting, a portion of the parental 
population of AB-1 ceils is injected into C57B1/6J blastocysts to ascertain whether the cells are 
capable of generating chimeric mice with extensive ES cell contribution and whether the majority of 
these chimeras can transmit the ES genome to progeny. 

In a preferred embodiment non-human zygotes are used, for example to make transgenic animals, 
using techniques known In the art (see U.S. Patent No. 4.873.191). Preferred zygotes include, but are 
not limited to, animar zygotes, including fish, avian and mammalian zygotes. Suitable fish zygotes 
include, but are not limited to, those from species of salmon, trout, tuna, carp, flounder, halibut, 
swordfish, cod, tulapia and zebrafish. Suitable bird zygotes include, but are not limited to. those of 
chickens, ducks, quail, pheasant, turkeys, and other jungle fowl and game birds. Suitable mammalian 
zygotes Include, but are not limited to, cells from horses, cattle, buffalo, deer, sheep, rabbits, rodents 
such as mice, rats, hamsters and guinea pigs, goats, pigs, primates, and marine mammals including 
dolphins and whales. See Hogan et al.. Manipulating the Mouse Embryo (A Laboratory Manual). 2nd 
Ed. Cold Spring Harbor Press. 1994. incorporated by reference. 

Once made and administered to a target host cell, the compositions of the invention find use In a 
number of applications, including the creation of transgenic plants and animals. Such transgenic 
animals can be any of the animals, fish and birds outlined above as suitable for zygotes. Preferably 
the transgenic animals are mammals, including, but not limited to, fanm animals such as cattle, buffalo, 
goats, including BELE® goats, sheep, and pigs or other transgenic animals such as mice, rabbits, 
monkeys, etc. In a preferred embodiment, the animals or mammals are non-human. 

In general, transgenic animals are made with any number of changes. Exogeneous sequences, or 
extra copies of endogeneous sequences, including structural genes and regulatory sequences, may 
be added to the animal, as outlined below. Endogeneous sequences (again, either genes or 
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regulatory sequences) may be disrupted, i.e. via insertion, deletion or substitution, to prevent 
expression of endogeneous proteins. Alternatively, endogeneous sequences may be modified to alter 
their biological function, for example via mutatbn of the endogeneous sequence by insertion, deletion 
or substitution. 

5 Accordingly. tThe methods of the present invention are useful to add exogenous DNA sequences, 
such as exogenous genes or regulatory sequences, extra copies of endogenous genes or regulatory 
sequences, or exogeneous genes or regulatory sequences, to a transgenic plant or animal. This may 
be done for a number of reasons: for example, adding one or more copies of a wild-type gene can 
increase the production of a desirable gene product; adding or deleting one or more copies of a 

10 therapeutic gene can alleviate a disease state, or to create an animal model of disease. Adding one or 
more copies of a modified wild type gene may be done for the same reasons. Adding therapeutic 
genes or proteins may yield superior transgenic animals, for example for the production of therapeutic 
or nutriceutical proteins. Adding human genes to non-human noammals may facilitate production of 
human proteins and adding regulatory sequences derived from human or non-human mammals may 

1 5 be useful to increase or decrease the expression of endogenous or exogenous genes. Such inserted 
genes may be under the control of endogenous or exogenous regulatory sequences, as described 
herein. 

The methods of the invention are also useful to modify endogeneous gene sequences, as outlined 
below. Suitable endogenous gene targets Include, but are not limited to, genes which encode 

20 peptides or proteins including enzymes, structural or soluble proteins, as well as endogeneous 
regulatory sequences including, but not limited to. promoters, transcriptional or translational 
sequences, repetitive sequencs including oligo[d(A-C)n •d(G-T)J, oligo[d(A-T)ln, oligo[d(C-T)]„. etc. 
Examples of such endogenous gene targets include, but are not limited to. genes which encode 
lactoglobulins including both a-lactoglobulin and $-lactoglobulin; casein, including both a-casein, 

25 IJ^sein and K-casein; albumins, including serum albumin, particulariy human and bovine; 
immunoglobulins, including IgE. IgM. IgG and IgDand monoclonal antibodies; gtobin; integrin; 
honnones; growth factors, particulariy bovine and human growth factors, including transforming growth 
factor, epidermal growth factor, nerve growth factors, etc.; collagen; interieukins. Including IL-1 to IL- 
17; a major histocompatibility antigen (MHC); G-protein coupled receptors (GPCR); nuclear receptors; 

30 Ion channels; multidrug resistance genes; amyloid proteins; enzymes, including esterases, proteases 
(including tissue plasminogen activator (tPA)). lipases, carbohydrases, etc.; APRT. HPRT; leptin; 
tumor suppressor genes; provirus; prions; OTC; CFTR; sugar transferases such as alpha-^alactosyl 
transferase (galT) or fucosyl transferase; a milk or urine protein gene including the caseins, lactofenin 
and whey proteins; oncogenes; cytokines, particulariy human; transcription factors; and other 
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pharmaceuticals. Any or all of these may also be suable exogeneous genes to add to a genome 
using the methods outlined herein. 

Endogeneous genes (or regulatory sequences, as outlined herein) may be modified In several ways 
including disruptions and alterations. 

The endogenous target gene may be disrupted in a variety of ways. The term Adisrupt® as used 
herein comprises a change in the coding or non^ing sequence of an endogenous nucleic acW that 
a«ers the transcription ortranslation of an endogenous gene. In a preferred embodiment, a disrupted 
gene will no longer produce a functional gene product. Genemlly. disruption may occur by either the 
insertion, deletion or frame shifting of nucleotides. 

10 The tem, Ainsertion sequence® as used herein means one or more nucleotides which are inserted 
into an endogenous gene to disrupt it In general, insertion sequences can be as short as 1 nucleotide 
or as long as a gene, as outlined below. For non-gene insertion sequences, the sequences are at 

least 1 nucleotide. With from about 1 to about 50 nucleotides being preferred, and fiom about ^ 
r,uc,eotides being particulariy preferred. An insertion sequence may comprise a polylinker sequence 
15 with from about 1 to about 50 nucleotWes being preferred, and from about 10 to 25 nucleotides being 

particularly preferred. 

In a preferred embodiment an insertion sequence comprises a gene which not only disrupts the 
endogenous gene, thus preventing its expression, but also can result in ttie expression of a new gene 
product Thus, in a preferred embodiment the disruption otan endogenous gene by an insertion 

20 sequence gene is done in such a manner to allow the transcription and translation of the insertion 
gene. An insertion sequence that encodes a gene may range from about 50 bp to 5000 bp of cDNA or 
about 5000 bp to 50000 bp of genomic DNA. As will be appreciated by those in the art this can be 
done in a variety of ways. In a preferred embodiment the insertion gene is targeted to the 
endogenous gene in such a manner as to utilize endogenous legulatory sequences. Including 

25 promoters, enhancers or a regulatory sequence. In an alternate embodiment the insertion sequence 
gene includes its own regulatory sequences, such as a promoter, enhancer or otiier regulatory 
sequence etc. . 



30 



Particulariy preferred insertion sequence genes indude. but are not limited to. genes which encode 
therapeutic and nutriceutical proteins, and reporter genes. Suftable insertion sequence genes which 
may be inserted into endogenous genes include, but are not limited to. nucleic acids which encode 
those genes listed as suitable endogeneous genes for alterations, above, particulariy mammalian 
enzymes, mammalian antibodies, mammalian proteins including semm albumin as well as mammalian 
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therapeutic genes. In a preferred embodiment, the inserted mammalian gene is a human gene. 
Suitable reporter genes are those genes which encode detectable proteins, such as the genes 
encoding luciferase. P-galactosidase (both of which require the addition of reporter substrates), and 
the fluorescent proteins, including green fluorescent protein (GFP). blue fluorescent protein (BFP), 
5 yellow fluorescent protein (YFP), and red fluorescent protein (RFP). 

Thus, in a preferred embodiment, the targeted sequence modification creates a sequence that has a 
biological activity or encodes a polypeptide having a biological activity. In a preferred embodiment, the 
polypeptide is an enzyme with enzymatic activity. In another preferred embodiment, the polypeptide is 
an antibody. In a third preferred embodiment, the polypeptide is a structural protein. 

1 0 In addition, the insertion sequence genes may be modified or variant genes, i.e. they contain a 

mutation from the wild-type sequence. Thus, for example, modified genes including, but not limited to, 
improved therapeutic genes, modified "-lactalbumin genes that do not encode any phenylalanine 
residues, or human enzynie or human antibody genes that do not encode any phenylalanine residues. 



The term Adeletion® as used herein comprises removal of a portion of the nucleic acid sequence of 
1 5 an endogenous gene. Deletions range from about 1 to about 100 nucleotides, with from about 1 to 60 
nucleotides being preferred and from about 1 to about 26 nucleotides being particularly preferred, 
although in some cases deletions may be much larger, and may effectively comprise the removal of 
the entire endogenous gene and/or its regulatory sequences. Deletions may occur in combination with 
substitutions or modifications to arrive at a final modified endogenous gene. 

20 In a prefen-ed embodiment, endogenous genes may be disrupted simultaneously by an insertion and a 
deletion. For example, some or all of an endogenous gene, with or without its regulatory sequences, 
may be removed and replaced with an insertion sequence gene. Thus, for example, all but the 
regulatory sequences of an endogenous gene may be removed, and replaced with an Insertion 
sequence gene, which is now under the control of the endogenous gene's regulatory elements. 

25 The term Aregulatory element® is used herein to describe a non-coding sequence which affects the 
transcription or translation of a gene including, but are not limited to. promoter sequences, ribosomal 
binding sites, transcriptional start and stop sequences, translational start and stop sequences, 
enhancer or activator sequences, or dimerizing sequences. In a preferred embodiment, the regulatory 
sequences include a promoter and transcriptional start and stop sequence. 
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Promoter sequer,ces encode either constitutive or indudbie promoters. The promoters may be either 
naturally occurring promoters or hybrid promoters. Hybrid promoters, which combine elements of 
more than one promoter, are also known in the art. and are useful in the present invention. 

In addition to disrupting endogeneous genes, the endogeneous genes may be altered by substitutions 
5 insertions or deletions of nucleotides that do not completely eliminate the biological function of the ' 
sequence, but rather alter it That is. targeted gene modifications may be made to alter gene function 
For example, defective genes may be fixed, or the activity of a gene may be modulated either 
.ncreasing or decreasing the activity of the sequence (either the nucleic add sequence, for example in 
the case of regulatory nucleic add. or of the gene product. i.e. the amino add sequence of the pK>tein 
10 may be altered). 

The methods of the present invention are useful to provide methods for fully or partially modifying 
endogenous regulatory sequences. Suitable targets for such fully or partially modified regulatory 
sequences include, but are not limited to. regulatory sequences that regulate any of the suitable 
endogeneous genes listed above, with preferred embodiments altering the endogeneous regulatory 
sequences that control the genes which encode --lactoglobulin. $-lactoglobulin. casein, a-casein P- 
casem. K^sein. serum albumin, globin. IgG. integrin. lactofenln. a retroviral provirus a prion 
alpha-galactosyl transferase (galT). a sugar transferase or a milk or urine producHon gene Examples 
of such fully or partially modified endogenous regulatory sequences indude. but are not limited to a 
modified regulatory element for an endogenous gene, a modified transcriptional regulation cassette or 
start site for an endogenous gene, a modified promoter, transcription initiation site, or enhancer 
sequences. 



15 



20 



When the modifk:ation of the endogeneous gene is to alter a structural gene, generally amino add 
dianges will be made as is known in the art Substitutions, deletions, insertions or any combination 
thereof may be used to arrive at a final derivative. General^ these d^riges are done on a few amino 
25 adds to minimize the alteration of the molecule. However, larger dianges may be tolerated in certein 
circumstances or for certain purposes. When small alterations in the characteristics of the 
endogeneous protein are desired, substitutions are genemlbr made in accordance with the following 
chart: ^ 

Ala 

Arg Ser 
35 CyS Z 
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Uic 


Acn r^ln 

ASlit will 




Leu Val 




lie Val 




Ara G\n Glu 

Wlllf 


Met 


Leu, lie 


Phe 


Met, Leu. Tyr 


Ser 


Thr 


Thr 


Ser 


Trp 


Tyr 


Tyr 


Trp. Phe 


Val 


lie. Leu 



Substantial changes in function or immunological identity are made by selecting substitutions that are 
less conservative than those shown in Chart I. For example, substitutions may be made which more 
significantly affect the structure of the polypeptide backbone in the area of the alteration, for example 
the a-hellcal or p-sheet structure: the charge or hydrophobicity of the molecule at the target site; or the 
5 bulk of the side chain. The substitutions which in general are expected to produce the greatest 
changes in the polypeptide's properties are those in which (a) a hydrophilic residue, e.g. seryl or 
threonyl. is substituted for (or by) a hydrophobic residue, e.g. leucyi, isoleucyt. phenyialanyl. valyl or 
alanyl; (b) a cysteine or proline is substituted for (or by) any other residue; (c) a residue having an 
electropositive side chain, e.g. lysyl. arginyl, or histidyl, is substituted for (or by) an electronegative 
10 residue, e.g. glutamyl or aspartyl; or (d) a residue having a bulky side chain, e.g. phenylalanine, is 
substituted for (or by) one not having a side chain, e.g. glycine. 

Prefenred embodiments of the present invention Include, but are not limited to: (1) a farm animal 
including cattle, sheep, pigs, horses and goats with a 1-25 base pair deletion, or a 10-25 base pair 
insertion of a polyiinker sequence, or insertion of a reporter gene such as a luciferase gene, a 

15 galactosidase gene or a green fluorescent (GFP) protein gene in an endogenous gene or sequence 
encoding omithine transcarbamylase (OTC), lactoglobulin, casein, p-casein. a-casein, K-casein, 
albumin, globin, immunoglobulin, IgG. interleukin. a sugar transferase, integrin, a milk protein, a urine 
protein, a retroviral provirus, an endogenous virus, a prk>n, a leptin. or cystic fibrosis transmembrane 
regulator (CFTR); (2) a fami animal including cattle, sheep, pigs, horses and goats with an exogenous 

20 gene such as a gene encoding human lysozyme, human growth hormone, human serum albumin. 
« human globin, a human antibody (human IgG), a tissue plasminogen activator, a human therapeutic 
protein, human lactase, a human lipase, a homione receptor gene, a viral receptor gene, a G-protein 
coupled receptor gene, a drug or a human enzyme gene, including for example the human lysozyme 
gene, the human a-1 anti-trypsin gene, the human anti-thrombin III gene; (4) a farm animal including 

25 cattle, sheep, pigs, horses and goats with a modified endogenous repeated (A-C)„ sequence, a 
modified repeated (A-G)„ sequence, a modified repeated (A-T)n sequence, a modified endogenous 
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CFTR gene or a modified endogenous OTC gene; (5) a farm animal including cattle, slieep. pigs, 
horses and goats with a modified "-lactoglobulin gene or $-lactoglobulin gene does not encode any 
phenylalanine residues; (6) a farm animal including cattle, sheep, pigs, horses and goats with a human 
monoclonal antibody gene, or a gene for a human antibody that does not encode any phenylalanine 
residues, for example inserted (or replacing) in the endogenous gene or sequence encoding an 
immunoglobulin, or IgG; and (7) a ferm animal including cattle, sheep, pigs, horses and goats with a 
human gene under control of its endogenous promoter, a modified endogenous regulatory element for 
an endogenous gene which may or may not be disrupted by an insertion sequence, a transcriptional 
regulation cassette ord a dimerizing sequence. Specific preferred embodiments also Include, a farm 
animal including cattle, sheep, pigs, horses and goats with an endogenous regulatory element which is 
disrupted by, deletion of at least one nucleotide. 

Additional preferred embodiments comprise a pig. monkey or cow with a 1-25 to 1-50 base pair 
insertion, examples of which include a hormone receptor gene, a viral receptor gene or a G-protein 
coupled receptor gene, or a 1-25 to 1-50 bp deletion in a sugar transferase gene including the a- 
galactosyl transferase gene (galT) or the fucosyl transferase gene, a BELB8) goat with a human gene, 
and a pig. goat, sheep or cow with a 1-25 base pair insertion or a 1-25 base pair deletion In a 
endogenous retroviral provirus gene such as deletion of the sequence for proviral KC. Further specific 
preferred embodiments include, a cow with a modified milk production gene such as. a cow with a 
lactase gene insertion in a milk promoter, a cow with the human lactoferrin gene replacing the bovine 
lactofemn gene, a monkey with a human therapeutic gene, or a human antibody gene, a cow with the 
human lipase gene in a milk promoter, a cow with a human gene placed in a transcription initiation site 
of a milk gene under the control of its endogenous promoter, a cow with a human gene placed in a 
transcription initiation site of a globin gene under the control of its endogenous globin gene promoter, a 
cow and goat with a modified urine protein gene, a mammal with a modified endogenous leptin gene, a 
modified endogenous OTC gene, a modified endogenous CFTR gene or a modified interieukin gene. 
Additional preferred embodiments include an animal such as a mouse, rabbit or goat with a 
transcriptional regulation cassette inserted in the transcriptional start site of an integrin gene, and a 
mouse with a modification in the integrin gene or G-protein coupled receptor gene. 

The vectors containing the DNA segments of interest can be transferred into the host cell by well- 
known methods, depending on the type of cellular host. For example, micro-injection is commonly 
utilized for target cells, although calcium phosphate treatment, electroporation, lipofection, biolistics or 
viral-based transfection also may be used. Other methods used to transform mammalian cells include 
the use of Polybrene, protoplast fusion, and others (sgg. Generally. Sarnbrook et al. Molecular Cloning: 
A Laboratory Manual, 2d ed., 1989, CoW Spring Harbor Laboratory Press, Cold Spring Harbor. N.Y.. 
whfch is incorporated herein by reference). Direct injection of DNA and/or recombinase-coated 
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targeting polynucleotides into target cells, such as sl^eletal or muscle cells also may be used (Wolff et 
al. (1000) Science 247 : 1465. which is incorporated herein by reference). 

Tarofttina of Endogenous HNA Senuences 

Once made and administered to a target host cell, the compositions of the invention find use in a 
number of applications, including the site directed modification of endogeneous sequences within any 
target cell, the creation of transgenic plants and animals, and the use of the compositions to do 
site-directed mutagenesis or modifications of target sequences. 

Generally, any predetermined endogenous DNA sequence, such as a gene sequence, can be altered 
by homologous recombination (which includes gene conversion) with an exogenous targeting 
polynucleotides (such as a complementary pair of single-stranded targeting polynucleotides). The 
target polynucleotides have at least one homology clamp which substantially corresponds to or is 
substantially complementary to a predetermined endogenous DNA target sequence and are 
introduced with a recombinase (e.g.. recA) into a target cell having the predetermined endogenous 
DNA sequence. Typically, a targeting polynucleotide (or complementary polynucleotide pair) has a 
portion or region having a sequence that is not present in the preselected endogenous targeted 
sequence(s) (i.e., a nonhomologous portion or mismatch) which rnay be as snrall as a single 
mismatched nucleotide, several mismatches, or may span up to about several kilobases or more of 
nonhomologous sequence. Generally, such nonhomologous portions are flanked on each side by 
homology clamps, although a single flanking homology clamp may be used. Nonhomologous portions 
are used to make inserttons, deletions, and/or replacements in a predetermined endogenous targeted 
DNA sequence, and/or to make single or multiple nucleotide substitutions in a predetermined 
endogenous target DNA sequence so that the resultant recombined sequence (i.e.. a targeted 
■ recombinant endogenous sequence) incorporates some or all of the sequence infomiation of the 
nonhomologous portion of the targeting polynucleotide(s). Thus, the nonhomologous regions are used 
to make variant sequences, i.e. targeted sequence modifications. Addittons and deletions may be as 
small as 1 nucleotide or may range up to about 2 to 4 kilobases or more. In this way. site directed 
modifications may be done In a variety of systems for a variety of purposes. 

In a preferred application, a targeting polynucleotide is used to repair a mutated sequence of a 
structural gene by replacing it or converting it to a wild-type sequence (e.g.. a sequence encoding a 
protein with a wild-type biological activity). For example, such applications could be used to convert a 
sickle cell trait allele of a hemoglobin gene to an allele whfch encodes a hemoglobin molecule that is 
not susceptible to sickling, by altering the nucleotide sequence encoding the $-subunil of hemoglobin, 
so that the codon at position 6 of the $-subunit is converted fromVal$6->Glu$6 (Shesely et al. (1991) 
op cit.^ . Other genetic diseases can be corrected, either partially or totally, by replacing, inserting. 
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and/or deleting sequence Information In a disease allele using appropriately selected exogenous 
targeting polynucleotides. For example but not for limitation, the )F508 deletion in the human CFTR 
gene can be corrected by targeted homologous recombination employing a recA-coated targeting 
polynucleotide of the invention. 



For many types of jn vivo gene therapy to be effective, a significant number of cells must be correctly 
targeted, with a minimum number of cells having an inconrectly targeted recombination event. To 
accomplish this objective, the combination of: (1) a targeting polynucleotide(5), (2) a recombinase (to 
provide enhanced efficiency and specificity of correct homologous sequence targeting), and (3) a cell- 
uptake component (to provide enhanced cellular uptake of the targeting polynucleotide), provides a 
means for the efficient and specific targeting of cells in yh^o. making in ytyo homologous sequence 
targeting, and gene therapy, practicable. 

Several disease states may be amenable to treatment or prophylaxis by targeted alteration of 
heptocytes in yivo by homologous gene targeting. For example and not for limitation, the following 
diseases, among others not listed, are expected to be amenable to targeted gene therapy: 
hepatocellular carcinoma, HBV Infection, famillai hypercholesterolemia (LDL receptor defect), alcohol 
sensitivity (alcohol dehydrogenase and/or aldehyde dehydrogenase insufficiency), hepatoblastoma, 
Wilson's disease, congenital hepatic porphyrias, inherited disorders of hepatic metabolism, omithine 
transcarbamylase (OTC) alleles, HPRT alleles associated with Lesch Nyhan syndrome, etc. Where 
targeting of hepatic cells io viyg is desired, a cell-uptake component consisting essentially of an 
asialoglycoprotein-poly-L- lysine conjugate is preferred. The targeting complexes of the invention 
which may be used to target hepatocytes jn yjyo take advantage of the significantly increased 
targeting efficiency produced by association of a targeting polynucleotide with a recombinase which, 
when combined with a cell-targeting method such as that of WO92/05250 and/or Wilson et al. (1992) 
J, Biol, Chem. 2g7 :963, provide a highly efficient method for performing mmQ homologous sequence 
targeting in cells, such as hepatocytes. 

In a prefen-ed embodiment, the methods and compositions of the invention.are used for gene 
Inactivation. That is, in addition to correcting disease alleles, exogenous targeting polynucleotides can 
be used to Inactivate, decrease or alter the biological acUvity of one or more genes in a cell (or 
transgenic nonhuman animal). This finds particular use In the generation of animal models of disease 
states, or in the elucidation of gene function and activity, similar to Aknock out® experiments. These 
techniques may be used to eliminate a biological function; for example, a galT gene (alpha galactosyl 
transferase genes) associated with the xenoreactivity of animal tissues in humans may be disrupted to 
form transgenic animals (e.g. pigs) to serve as organ transplantation sources without associated 
hyperacute rejection responses. Alternatively, the biological activity of the wild-type gene may be 
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either decreased, or the wild-type activity altered to mimic disease states. This Includes genetic 
manipulation of non-coding gene sequences that affect the transcription of genes, including, 
promoters, repressors, enhancers and transcriptional activating sequences. 

Once the specific target genes to be modified are selected, their sequences may be scanned for 

5 possible disruption sites (convenient restriction sites, for example). Plasmids are engineered to 

contain an appropriately sized gene sequence with a deletion or insertion in the gene of interest and at 
least one flanking homology damp which substantially corresponds or is substantially complementary 
to an endogenous target DNA sequence. Vectors containing a targeting polynucleotide sequence are 
typically grown in E coli and then isolated using standard molecular biology methods, or may be 

1 0 synthesized as oligonucleotides. Direct targeted inactivation which does not require vectors may also 
be done. When using microinjection procedures it may be preferable to use a transfection technique 
with linearized sequences containing only nrK)dified target gene sequence and without vector or 
selectable sequences. The modified gene site is such that a homologous recombinant between the 
exogenous targeting polynucleotide and the endogenous DNA target sequence can be identified by 

15 using carefully chosen primers and PGR, followed by analysis to detect if PGR products specific to the 
desired targeted event are present (Eriich et al.. (1991) Science 252: 1643. which is incorporated 
herein by reference). Several studies have already used PGR to successfully identify and then done 
the desired transfected cell lines (Zimmer and Gruss, (1989) Nature 338: 150; Moueilic et al., (1990) 
pme Natl Acad. Sd. USA gZ: 4712; Shesely et al.. (1991) Pmc Natl. Acad. Sd. USA fig: 4294. which 

20 are incorporated herein by reference). This approach is very effective when the number of cells 
receiving exogenous targeting polynucleotide(s) is high (i.e., with microinjection, or with liposomes) 
and the treated cell populations are allowed to expand to cell groups of approximately 1x10* cells 
(Capecchi. (1989) Science 244 : 1288). When the target gene is not on a sex chromosome, or the 
cells are derived from a female, both alleles of a gene can be targeted by sequential inactivation 

25 (Mortensen et al., (1 991 ) Proc. Natl. Acad. Sd. USA 88: 7036). 

In addition, the methods of the present invention are useful to add exogeneous DNA sequences, such 
as exogeneous genes or extra copies of endogeneous genes, to an organism. As for the above 
techniques, this may be done for a number of reasons, including: to alleviate disease states, for 
example by adding one or more copies of a wild-type gene or add one or more copies of a therapeutic 

30 gene; to create disease models, by adding disease genes such as oncogenes or mutated genes or 
even just extra copies of a wild-type gene; to add therapeutic genes and proteins, for example by 
adding.tumor suppressor genes such as p53. Rbl . Wt1 . NF1 , NF2, and APC. or other therapeutic 
genes; to make superior transgenic animals, for example superior livestock; or to produce gene 
products such as proteins, for example for protein production, in any number of host cells. Suitable 

35 gene products indude. but are not limited to, Rad51 , alpha-antitrypsin. casein, hormones. 
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antithrombin III. alpha glucosidase. collagen, proteases, viral vaccines, tissue plaminogen activator, 
monoclonal antibodies. Factors VIII. IX. and X. glutamic acid decarboxylase, hemoglobin, 
prostaglandin receptor, lactofemn. calf intestine alkaline phosphatase. CFTR. human protein C, 
porcine liver esterase, urokinase, and human serum albumin. 

Thus. In a preferred embodiment, the targeted sequence modification creates a sequence that has a 
biological activity or encodes a polypeptMe having a biological activity. In a preferred embodiment, the 
polypeptide is an enzyme with enzymatic activity. 

In addition to fixing or creating mutations involved in disease states, a preferred embodiment utilizes 
the methods of the present invention to create novel genes and gene products. Thus, fully or partially 
random alterations can be incorporated into genes to form novel genes and gene products, to produce 
rapidly and efficiently a number of new products which may then be screened, as will be appreciated 
by those in the art. 



In a preferred embodiment, the compositions and methods of the Invention are usefuf in site-directed 
mutagenesis techniques to create any number of specific or random changes at any number of sites 
or regions within a target sequence (either nucleic add or protein sequence), similar to traditional 
site-directed mutagenesis techniques such as cassette mutagenesis and PGR mutagenesis. Thus, for 
example, the techniques and compositions of the invention may be used to generate site specific 
variants in any number of systems, including E. coli, Bacillus. Archebacteria, Thermus. yeast 
(Sacc/jromyces and Pichia), insect cells (Spodoptera, Trichoplusla. Drosophila), Xenopus. rodent cell 
lines including CHO. NIH 3T3 and primate cell lines including COS, or human cells, including HT1080 
and BT474, which are traditfonally used to make variants. The techniques can be used to make 
specific changes, or random changes, at a parttoularsite or sites, within a particular region or regions 
of the sequence, or over the entire sequence. 

In this and other embodiments, suitable target sequences include nucleic add sequences encoding 
therapeutically or commercially relevant proteins, induding. but not limited to. enzymes (proteases, 
recombinases, lipases, kinases, carbohyd rases, isomerases, peptides tautomerases. nucleases ete.), 
hormones, receptors, transcription factors, growth factors, antibodies, cytokines, gipbin genes, 
immunosupppressive genes, tumor suppressors, oncogenes, complement-activating genes, milk 
proteins (casein, "-lactalbumin, B-lactoglobulin, whey proteins, semm albumin), immunoglobulins, urine 
proteins, mflk proteins, esterases, phannaceutfcal proteins and vaccines. 

In a preferred embodiment, the methods of the Invention are used to generate pools or libraries of 
variant nudeic add sequences, and cellular libraries containing the variant libraries. Thus, in this 
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embodiment, a plurality of targeting polynucleotides are used. The targeting polynucleotides each 
have at least one homology clamp that substantially corresponds to or is substantially complementary 
to the target sequence. Generally, the targeting polynucleotides are generated in pairs; that is, pairs 
are made of two single stranded targeting polynucleotides that are substantially complementary to 

5 each other (i.e. a Watson strand and a Crick strand). However, as will be appreciated by those in the 
art, less than a one to one ratio of Watson to Crick strands may be used; for example, an excess of 
one of the single stranded target polynucleotides (i.e. Watson) may be used. Preferably, sufficient 
numbers of each of Watson and Crick strands are used to allow the majority of the targeting 
polynucleotides to form double D-loops. which are prefen-ed over single D-loops. as outlined above. In 

10 addition, the pairs need not have perfect complementarity; for example, an excess of one of the single 
stranded target polynucleotides (i.e. Watson), which may or may not contain mismatches, may be 
paired to a large number of variant Crick strands, etc. Due to the random nature of the pairing, one or 
both of any particular, pair of single-stranded targeting polynucleotides may not contain any 
mismatches. However, generally, at least one of the strands will contain at least one mismatch. 

15 The plurality of pairs preferably comprise a pool or library of mismatches. The size of the library will 
depend on the number of residues to be mutagenized, as will be appreciated by those in the art 
Generally, a library in this Instance preferably comprises at least 40% different mismatches, with at 
least 30% mismatches being prefen-ed and at least 10% being particularly preferred. That is, the 
plurality of pairs comprise a pool of random and preferably degenerate mismatches over some regions 

20 or all of the entire targeting sequence. As outlined herein, Amismatches® include substitutions, 
insertions and deletions. Thus, for example, a pool of degenerate variant targeting polynucleotides 
covering some, or preferably all. possible mismatches over some region are generated, as outlined 
above, using techniques well known in the art. Preferably, but not required, the variant targeting 
polynucleotides each comprise only one or a few mismatches (less than 10). to allow complete 

25 multiple randomization, as outlined below. 

As will be appreciated by those in the art, the Introduction of a pool of variant targeting polynucleotides 
(in combination with recombinase) to a target sequence, either in vitro to an extrachromosomal 
sequence or in vivo to a chromosomal or extrachromosomal sequence, can result in a large number of 
homologous recombination reactions occuring over time. That is, any number of homologous 
30 recombination reactions can occur on a single target sequence, to generate a wide variety of single 
and multiple mismatches within a single target sequence, and a library of such variant target 
sequences, most of which will contain mismatches and be different from other members of the library. 
This thus works to generate a library of mismatches. 
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In a preferred embodiment, the variant targeting polynucleotides are made to a particular region or 
domain of a sequence (i.e. a nucleotide sequence that encodes a particular protein domain). For 
example, it may be desirable to generate a library of all possible variants of a binding domain of a 
protein, without affecting a different biologically functional domain, etc. Thus, the methods of the 
present invention find particular use in generating a large number of different variants within a 
particular region of a sequence, similar to cassette mutagenesis but not limited by sequence length. In 
addition, two or more regions may also be altered simultaneously using these techniques. Suitable 
domains include, but are not limited to, kinase domains, nucleotide-binding sites, DNA binding sites, 
signaling domains, structural domains, receptor binding domains, transcriptional activating regions, 
promoters, origins, active enzyme domains, dimerizing domains, leader sequences, terminators, 
localization signal domains, and. in immunoglobulin genes, the complementalty determining regions 
(CDR). Fc.V„andV,. 

In a preferred embodiment, the variant targeting polynucleotides are made to the entire target 
sequence. In this way, a large number of single and multiple mismatches may be made In an entire 
sequence. 

Thus for example, the methods of the invention may be used to create superior recombinant reporter 
genes such as /acZ. iuiciferase and green fluorescent protein (GPP); superior antibiotic and drug 
resistance genes; superior recombinase genes; superior recombinant vectors; and other superior 
recombinant genes and proteins, including peptides, immunoglobulins, vaccines or other proteins with 
therapeutic value. For example, targeting polynucleotides containing any number of alterations may 
be made to one or more functional or structural domains of a protein, and then the products of 
homologous recombination evaluated. 

Once made and administered to target cells, the target cells may be screened to identify a cell that 
contains the targeted sequence modification. This will be done in any number of ways, and will 
depend on the target gene and targeting polynucleotides, as will be appreciated by those in the art. 
The screen may be based on phenotypic, biochemical, genotypic, or other functional changes, 
depending on the target sequence. In an additional embodiment, as will be appreciated by those in 
the art, selectable mari<ers or mari<er sequences may be included in the targeting polynucleotides to 
facilitate later identification. 

In a preferred embodiment, kits containing the compositions of the Invention are provided. The kits 
include the compositions, particulariy those of libraries or pools of degenerate cssDNA probes, along 
with any number of reagents or buffers, including recombinases. buffers. ATP, etc. 
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The broad scope of this invention is best understood with reference to the following examples, which 
are not intended to limit the invention in any manner. All references cited herein are expressly 
incorporated by reference. 

pypppiMFNTA L EXAMPLES 

5 EXAMPLE 1 

Hnmnln nniis Tarnstino of recA- C nated Chemicallv-Modified PolYnUCleotitfe? In Cg»9 

Homologously targeted exogenous targeting polynucleotides specifically target human DNA 
sequences in intact nuclei of metabolicaliy active cells. RecA-coated complementary exogenous 
targeting polynucleotides were introduced into metabolicaliy active human cells encapsulated in 

1 0 agarose microbeads and permeabilized to permit entry of DNA/protein complexes using the Jackson- 
Cook method (Cook. P.R. fi984^ EMBO J. 3: 1837; Jackson and Cook (1985) EMBQA^: 919; 
Jackson and Cook (1985) EMBO J. 4: 913; Jackson and Cook (1986) J, Mffl, BiqL 192: 65; Jackson et 
al. (1988) .1 CeiLSci. 2Q: 365. whfch are incorporated herein by reference). These experiments were 
designed to specifically target homologous DNA sequences vnth recA protein in intact nuclei of 

IS metabolicaliy active human HEp-2 cells. 

Jackson and Cook prevfously demonstrated that the nuclear membranes of human or other cells may 
be permeabilized without loss of metabolic function when the cells are first encapsulated in a gel of 
agarose mfcrobeads. The agarose microbead coat contains the cell constituents and preserves native 
conformation of chromosomal DNA. while permitting diffusion of macromolecules into and out of the 

20 cell compartment. Wittig et al.(1991) Pmr. Natl Acad. Sci. (U.S.A.). gg: 2259. which is incorporated 
herein by reference, demonstrated that monoclonal antibodies directed against left-handed Z-DNA 
could be diffused into these agarose-embedded cells, and that the antibodies were specifically 
targeted to chromosomal sequences and conformations. In a similar manner, we incubated biotin- or 
FITC-labeled complementary DNA targeting polynucleotides coated with recAwrith agarose-coated cell 

25 nuclei and verified the conrect homologous targeting of the exogenous targeting polynucleotides to 
specific predetermined human DNA sequences in cell nuclei of metabolicaliy active cells. 

RecA-mediated homotogous gene targeting with complementary oligonucleotides in intact human cell 
' nuclei was verified directly by homologous targeting using targeting polynucleotides Uiat were 
btotinylated. These were subsequently labeled with a fluorescent reporter compound to verify 
30 homologous pairing at specific locations having the predetermined sequence(s). RecA-coated 

targeting polynucleotides for human chromosome 1 pericentrometric alpha-satellite DNA sequences 
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were specifically targeted to chromosome 1 centromere sequences in living human cell nuclei that 
were pernieabilized and suspended in agarose. 

In these experiments. recA^oated biotinylated exogenous targeting polynucleotides containing 
homologous sequences to human chromosome 1 alpha satellite DNA were inculated with human 
HEp-2 cells. The cells were emt)edded in agarose, then treated with standard buffers (according to 
Jaclcson and Cook. SExiL) to remove the cytoplasmic membrane and cytoplasm immediately before 
the addition of targeting polynucleotide coated with recA protein. 

The experiments were performed with the following results: 

First, in order to test protocols to be used in nuclear encapsulation, freshly trypsinized growing human 
HEp-2 tumor cells were suspended in complete DMEM encapsulated in a mixture of agarose (2.5%, 
Fisher-Biotech) and complete DIWEIVI media adapting the protocols of Nilsson et al.. 1983. so that the 
final agarose concentration was 0.5% (4 volumes cells in suspension with 1 volume 2.5% agarose), 
and the final cell concentration range was approximately 2.4 x 10' to 8 x 10*. The encapsulated cells 
in agarose "beads® were placed in petri dishes to which DMEIVI complete media was added and were 
allowed to grow for 24 hr in an incubator at 37X . 7% COj. At 24 hr. the cells were clearly growing 
and multiplying and thus were alive and metabolically active. 

An aliquot of agarose containing ceils (in beads in DMEM medium) was treated to remove the 
cytoplasmic membrane and cytoplasm by addition of ice-cold sterile PBS. New Buffer (Jackson et al. 
(1988)fiasiL; 130 mM KC1, 10 mM Na^HPO,. 1 mM MgC1j. 1 mM Na^TP. and 1 mM dithithreitol. pH 
7.4 ). New Buffer with 0.5% Triton-X 100. New Buffer with 0.2% BSA, then was centrifuged at low 
speed using protocols developed by Jackson and Cook. 1985 and 1986 op.cit.: witBg et al. (1989) J, 
CelLEiel J08: 755; Wittiget al. (1991) pp.cit.) who have shown that this treatment alknvs the nuclear 
membrane to remain morphologically intact The nuclei are metabolically active as shown by a DNA 
synthesis rate of 85 to 90% compared with that of untreated control cells. 

Cytoplasm was effectively removed by the above treatment, and the encapsulated nuclei were intact 
as demonstrated by their morphology and exclusion of 0.4% trypan blue. Nuclei in agarose were 
returned to the humidified COj incubator at 37X for 24 hr and remained metaboli(illy active. We 
observed that sterile mineral oil used in the emulsification process was difficult to remove entirely and 
interfered with the microscopto visualization of suspended nuclei. Therefore, the cell^garose 
suspension process was simplified. In subsequent experiments cells were gentty vortexed with melted 
(39'C) agarose, then the agarose-cell mixture was sterilely minced before New Buffer treatments. 
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This simpler process, eliminating the oil step, makes it easier to visualize the cells and chromosomes 
at the completion of reactions. 

After mincing of the agar and New Buffer treatments of the cells, the above protocols vi^ere used to 
homologously target endogenous DNA sequences in encapsulated nuclei as follows: 16.5 pi recA- 
coated (or non-recA-coated control) nick-translated DNA (labeled with btotin-14-dATP) targeting 
polynucleotide was prepared and bound under standard native recA protocols (SfiS U.S.S.N. 
07/755.462 and 07/910.791). Minced agarose fragments were centrifuged and New Buffer 
supematant removed. The fragments were resuspended in 1 X AC buffer in a 1.5-nil Eppendorf tube, 
ttien centrifuged for removal of the buffer (leaving an estimated 50 to 75 \t\ of buffer), and prepared 
targeting polynucleotide was mixed with the fragments of agarose-cpntaining nuclei. Reactions were 
incubated in a 37»C water bath for 2 to 4 hr. then washed, incubated in standard preblock solution, 
then in preblock supplement with 10 pg/ml FITC-avidin (Vector. DCS grade), and again washed. 
Experimental results were analyzed by placing a minute amount of a reaction with 3 to 4 pi antifade on 
a slide with a slide cover and viewing it by using the Zeiss CLSM-10 confocal laser scanning 
microscope (CLSM). Completed reactions were also stored refrigerated for later examination. 

In the first in yim experiment, metabolically active HEp-2 cells suspended in 1 x PBS were 
encapsulated in agarose by gentle vortexing, treated using New Buffer protocols, then incubated for 3 
hr 15 min with 100 ng of recA-coated targeting polynucleotide specifto for Chromosome 1 alpha- 
satellite DNA biotinylated with bio.14-dATP by nick translation (BRL. Nick Translation System) using 
pUC 1.77 plasmid DNA (a 1.77 kb long EcoRI fragment of human DNA In the vector pUC9; Cooke et 
al. (1979) IludSi£A2id&£SSLfi: 3177; Emmerich et al. (1989) Fxp , C^jl , Rgs. IM: 126). We obseived 
specifto' targeting by the alpha-satellite targeting polynucleotide to pericentromeric chromosome 1 
targets in Intact nuclei of metabolically active cells. The signals were essentially identical to those 
using the same targeting polynucleotide with methanol (or ethanol) fixed HEp-2 cell targets in 
suspension. Figure 1 shows specific targeting signals in several metabolically active cells from this 
experiment. 

In the second in vivo experinrient. cells suspended in incomplete DMEM media instead of 1 x PBS 
were encapsulated in agarose and treated with 62.5 ng of the same targeting polynucleotWe used in 
the first experiment described above and 62.5 ng of a freshly biotinylated targeting polynucleotide 
prepared under the same protocols. In this experiment, the minced agarose fragments were not 
resuspended in 1 x AC buffer before addition of targeting polynucleotide and some nuclei 
disintegrated, especially with subsequent centrifugation. The results show that in the nuclei that 
remained intact, the targeting polynucleotides coated with recA specifically targeted predetermined 
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human DNA targets. In contrast targeting polynucleotides in control reactions without recA did not 
target the human PNA sequences. 

Thus, the recA-coated targeting polynucleorides were targeted to the repetitive alpha satellite 
sequences of chromosome 1. This result showed DNA targeting in intact nuclei to specific human 
chromosome 1 sequences (data not shown). 

In the third experiment, ceils were suspended in 1 x PBS or in incomplete DMEM media before 
vortexing with agarose and were tested using 62.5 ng of targeting polynucleotide in reactions with and 
without recA protein. In addition, the reactions were divided in half and washed and FITC-avidin 
treated in either buffer adjusted to pH 7 or pH 7.4, Cells were incubated with the recA coated targeting 
polynucleotide for 3 hr 25 min. Live nuclei treated with targeting polynucleotide alone without recA 
showed no signals. In the recA-treated reactions, relatively weaker signals were observed in nuclei 
incubated in 1 x PBS, whereas very strong specific signals were present in nuclei that had been 
Incubated in incomplete DMEM. There was clearly significantly more signal present in nuclei that were 
washed and treated with FITC-avidin at pH 7.4 compared with nuclei incubated at pH 7.0. Figure 4 
shows nuclei that were treated with recA coated targeting poiy nucleotides and incubated at both pH 
7.4 and 7.0. 

In a fourth experiment, HEp-2 cells were embedded in agarose prepared with I x PBS. New Buffer 
treated, then treated with 100 ng of biotinylated targeting polynucleotide complementary to 
chromosome 1 alpha-satellite DNA. Controls in this experiment also included reactions without recA 
protein and additional control reactions supplemented with an identical amount of BSA protein to 
replace the recA protein. Additionally, cells were also embedded in agarose prepared with I x AC 
buffer. Examples of specific targeting to endogenous target sequences were recorded. 

In a fourth experiment, we directly determined if the embedded nuclei under the conditions used above 
were metabolically active. The nuclei in agarose were incubated with blo-21-rUTP in complete 
medium, then incubated for 2 days in the humidified COj atmosphere. After 2 days at 37'C. the cells 
were examined Bio-21-rUTP was Incorporated in RNA and incubated with FITC-streptavidin. FITC 
was specifically associated with nucleoli indicative of ribosomal RNA biosynthesis, thus directly 
showing metabolic activity in these human cells. Similar results were obtained usinjg DNA precursors 
to measure DNA synthesis. In this experiment It was clear that the majority of nuclei in the PBS 
agarose reaction had condensed chromosomes. There was nuclear activity in a number of these 
nuclei also, indicative of full metabolic viability, which was also shown in the AC buffer^treated cells. 

A fifth experiment was performed using, again, HEp-2 cells embedded in agarose. Final concentration 
of the cells in agarose was 3.7 x IO®/mI. The cells were suspended in 1 x PBS prior to combining with 
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agarose. The.final agarose concentration was 0.5%. There were two reactions, one in which recA 
was used to coat targeting polynucleotide, the second in which recA protein was replaced by BSA at 
the same protein concentration followed by New Buffer treatments to remove the cytoplasm. The 
nuclei in agarose were incubated for 3 hr with targeting polynucleotide, then processed for detection of 
correctly targeted polynucleotide using the protocols describe previously. FITC-avidin was used to 
visualize the biotinylated targeting polynucleotide at a concentration of 20 pg/ml. Results showed that 
cells with the recA-coated oomplementaiy targeting polynucleotide displayed specific signals in 25% or 
more of the intact nuclei. In contrast, the BSA-treated controls (without RecA) did not show any signal. 

Cells in agarose from this experiment were further incubated at 37'C in the COj incubator in complete 
medium. At 22 hr. these cells were metabolically active. Chromosomes were condensed, and a 
number of nuclei were in the process of dividing. In these experiments, a significant number of the 
cells incubated with recA-coated complementary targeting polynucleotides showed specific signal, 
whereas 0% of the cells incubated with targeting polynucleotide alone showed specific signal. 

In summary. recA-coated biotinylated targeting polynucleotides for human chromosome 1 alpha- 
satellite DNA were specifically targeted to human HEp-2 epithelial carcinoma chromosomal DNA in 
intact cell nuclei of metabolically active cells that had been suspended in agarose, then treated with 
buffers and recA-coated targeting polynucleotides under suitable reaction conditions (sues and 
U.S.S.N. 07/755.462: U.S.S.N. 07/755.462; and U.S.S.N. 07/520.321 . incorporated herein by 
reference). Specific binding by the recA-coated targeting polynucleocide to chromatin alpha-satellite 
DNA was observed only in the agarose embedded nuclei which were incubated with recA-coated 
targeting polynucleotides. Control nuclei incubated with targeting polynucleotides in the absence of 
lecA and/or with nonspecific protein exhibited no signal. 

Taroetina of Human dS3 Gene 

We performed recA-mediated homologous targeting of biotinylated targeting polynucleotides that were 
homologous to the human p53 tumor suppressor gene, and compared the results to targeting of alpha 
satellite DNA sequences in human chromosome 1. In these experiments, exponentially growing cells 
were trypsinized, washed, suspended in incomplete medium and encapsulated in asprose. The 
agarose was minced into pieces with a razor blade and the encapsulated cells were treated with New 
Buffer. A sample from each group was removed to verify that nuclei were intact 

Nuclei were washed in 1 x AC buffer and incubated with recA-coated comptementary single-stranded 
DNA oligonucleotides (i.e., exogenous targeting polynucleotides) for 3.5 hours at 37'C. The alpha 
satellite DNA targeting polynucleotides for chromosome 1 were previously described and were niclc- 
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translated with biotinylated deoxyribonucleotides (bio-14-dATP). The p53 tumor suppressor gene 
polynucleotide was obtained from Oncor (209 Perry Parkway. Gaithersburg, MD 20877) and is a 1.2 
kilobase cDNA fragment from a wild-^ype human p53 gene (Fields and Jang, (1990) Science 242 : 
1046; Miller at al. (1986) Nature m 783; Zakut-Houre et al. (1985) EMBOJ. 4: 1251 ). The 1.2 
5 kilobase human p53 DNA was nick-translated with biotinylated deoxyribonucleotides and yielded a 
populatfon of biotinylated targeting polynucleotides having a size range (about 100 to 600 nucleotides) 
similar to that obtained ft>r the human chromosome 1 alpha satellite targeting polynucleotides. The 
targeting polynucleotides were separately incubated with encapsulated cells. Following incubation 3 
washes of 1.75 x SSC were done, and sampled nuclei were verified as intact after the washing step. 

1 0 After washing, the targeted encapsulated cell nuclei were incubated in preblock and FITC-avidin was 
added to preblock buffer to a final concentration of 20 pg/ml for 15 minutes in the dark. The targeted 
encapsulated cell nuclei were washed sequentially in 4 x SSC. 4 x SSC with 0.1% Triton X-100, and 
then 4 x SSC. Samples of nuclei were again taken and used to verify that the targeted nuclei were 
metabolically active. Microscopic examination showed that metabolically active cells contained 

1 5 specific FITC-targeting polynucleotide: targeted endogenous sequence complexes (shown in Figure 
2). The p53 targeting polynucleotides were specifically targeted to human chromosome 17, the 
location of the endogenous human p53 gene sequences, indteating specific pairing of a targeting 
polynucleotide to a unique endogenous DNA target sequence. The human chromosome 1 alpha 
satellite DNA was also specifically targeted to ttie chromosome 1 pericentromeric satellite sequences. 

The experiments validated a highly specific DNA targeting technique for human or other cells as 
evklenced by homologous sequence targeting techniques in metabolically active cells. The targeting 
technique employs the unique properties of recA-nnediated DNA sequence targeting with single- 
stranded (complementary) short tai^eting polynucleotides. Native intact nuclei were incubated with 
labeled, heat-denatured targeting polynucleotides coated with recA protein. The DNA hybridized to 
the predetermined targeted homologous sequences. In these experiments, ttie targeting 
polynucleotides formed paired complexes with specific gene sequences wrthin metabolically active cell 
nuclei. This in vivo targeting by recA-mediated homologous taigeting polynucleotides shows tt>e 
targeting specificity and therapeutic potential for this new in yivQ mettiodotogy. Application of recA or 
other recombinase-mediated targeting of (complementary) ssDNA or denatured dsDNA targeting 
polynucleotides to predetermined endogenous DNA targets is important for gene entry, gene 
knockout, gene replacement, and gene mutation or correction. 

EXAMPLE ? 

Correcting a Mutant Gene to Produce a Functional Gene Product 
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Homologously targeted complementary DNA oligonucleotides were used to correct 11 bp insertion 
mutationsin vector genes and restore vector gene expression and vector protein function in microinjeate 
mammalian cells. 

Experimentswere designed to testwhether homologously targeted complementary 276-bp oligonudeotid 
targeting polynucleotides could correct an 11-bp insertion mutation in the lacZ gene of a mammaliiMMA 
vector, wrtiich encoded a nonfunctional $-galactosidase. so that a corrected lacZ gene encoded and 
expressed a functional enzyme. Functional enzyme (S^alactosidase) was detected by an X-gal assay 
that turns cells expressing a revertant (i.e.. corrected) lacZ gene a blue color. 
NIH3T3 cells microinjected with the mutant test vector bearing an 1 1 basepair insertion indliacZ coding 
sequence do not produce any detectable functional $-galactosidase enzyme. In contrast, cells 
microinjected with the wild type test vector do produce functional enzyme. 

We obtained the functional lac plasmid pMCIIacpA for use as a positive control for expression of $- 
galactosidase. pMCIIacXpA is the target test mutant plasmid (shown in Figure 3). It is identical to 
pMCIIacpA (shown in Figure4) but has a 1 1-bp Xbal linker insertional mutation. This plasmid does not 
express $-galactosidase activity in rouse NIH3T3 cells when introduced by electroporation. It does not 
produce blue color in the presence of X-gal indicative of $-galactosidase production following vector 
micro-injection. N^ative controls with mock or noninjected cells we also done. Using these conditions 
and NIH3T3 cells have no detectable background blue staining. 

The plasmid pMCIIacpA (8.4 kb) contains the strong polyoma virus promoter of transcription plus ATG 
placed in front of the lacZ gene. The polyadenylatfon signal from SV40 vims was placed in back of the 
lacZ gene. The plasmid vector was plB130 from IBI (New Haven. CT). The mutant vector pMCIIacpA 
has a 11-bp inserton in the Xbal site consisting of the inserted sequence CTCTAGACGCG (see Figure 
5). 

in several control micro-injection experiments using pMCIIacXpA weonsistently failed to detect any biB 
microinjected cells. In contrast, in various experiments nronitored early after microinjection apprimately 
9 to 13% of the NIH3T3 cells injected with pMCIIacpA DA expressed $-galactosidase as evidenced by 
their blue color. No cells microinjected with injection buffer alone omock injected were observed as blue 

We synthesized two 20-bp primers (PCR" and PCR$) for producing a 276-bp PCR product (see Figure 
5) from the wild-type lacZ sequence for use as targeting polynucleotWes. We chose this 276-bp fragint 
to span the 11 bp insertion mutation as a nonhonrwlogous sequence. The 276-bp DNA oligonucleotide 
was separated by gel electrophoresis and electroeluted from agarose, ethanol precipitated, and its 
concentration determined by absortance at 260 nm. The 276-bp fragment was 5* end-labeled with «P 
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and specifically D-looped with the pMcllacXpA or pMCIIacpA plasmid DNA using recA as shown by 
agarose gel electrophoresis. 

Experiments were deigned to test for $^alactoside production in cells microinjected with pMCIIacXpA 
vectors with targeting polynucleotide-target complexes using complementary 276-bp oligonucleotide 
targeting polynucleotide treated with recA. The 276-mer targstg polynucleotides in 1 X TE buffer were 
denatured by heating at lOO'C for 5 min and immediately quenched in an ice bath for 1 min. The ON A 
solution was collected at A'C by centrifugation. RecA-mediated targeting polynucleotide reactions 
containing a final volumeof 10 jjl were assembled using 1.0 jjI 10 x AC buffer. 1.5 pi 16 mM ATP(S. 3.8 
Ml dd H30, 1.2 Ml recA protein solution (13 pg/pl), and 2.5 pi of a 30 pg/ml stock of htaienatured 276-bp 
targeting polynucleotide. The recA protein was allowedotcoat the DNA for 10 min at 37'C. Next. 1.0 pi 
of 10 X AC buffer. 10 Ml of 0.2 M magnesium acetate. 1 .3 pi of pMCIIacXpA (1.0 pg/MD. and 6.7 pi of dd 
HjO was added to a final volume of 20 mI. Control reactions werperformed without added recA protein. 

NIH3T3 cells were capiMary needle microinjected with targeting polynucleotide-target DNA mixtures loade 
in glass pipettes freshly pulled into microneedles using a Sutter instruments microprocessor controlled 
apparatus. An ECET Eppendorf microinjection pump ad computerized micromanipulator were used for 
computer-assisted microinjection using an Olympus IMT-2 inverted microscope. Cells were carefully 
microinjected under controlled pressure and time. NIH3T3 cells injected with pWICIIacpA showed 
approximately 9% of the injected cells were blue. None (0%) of the celtajecled with plWICIIacXpA DNA 
in reactions containing the 271 bp oligonucleotide but without recA protein showed a blue color. In marHe 
contrast, approximately 3.6% of the cells microinjected with the recA-coated 271-bp targeting 
polynucleotide targeted to-the pMCI lacXpA target hybd were blue (Figure 6). indicating that the mutant 
pIMCIIacXpAgene can be targeted and corrected by the 271-bp oligonucleotide, whichas been targeted 
with recA-coated targeting polynuototides. In summary, these measurements show that the 11 bp Xba 
I insertionmutation can be corrected with the recA-mediated targeted corrected in JflJffi. but not with the 
271-bp oligonucleotide alone. Notdhat the 1q gjlu identification of 3T3 cells expressing $-galactosidase 
was perfomied following incubation with X-gal (5-bromo-4-chloro-3-indolyl-$- galactopyranoside) (Sigrpa) 
as described by Fische et al. (1 988) Nature 332: 853; Price et al. (1 987) PlQSLlisSL6SSSLSsaJii^ 
M: 156; Lim and Chae (1989) Bial££hDiflUSS 2: 576. 

EXAMPLE ■i 

30 Correcting a Human CFTR n isease Alteiy 

Homologously targeted complementary DNA)ligonucleotides were used to correct a naturally occumng 
3 bp deletion mutation in a human CFTR allele and restore expressfon of a functional CFTR protein In 
targeted mammalian cells. 
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A major goal of cystic fibrosis (CF) gene therapy is the correction of mutant portions of the CF 
transmembrane conductance regulator (CFTR) gene by replacement with wild-type DNA sequences to 
restore the normal CFTR protein and lortransport function. Targeting polynucleotides that were coated 
with recA protein were introduced into transformed CF airway epithelial cells, homozygous for bothedtts 
5 AF508 CFTR gene mutation, by either intranuclear microinjection, eledporation, or by transfection with 
a protein-DNA-lipid complex. 

Isolation and characterizaticn of the CFTR gene (Rommens et al. (1989) Science 245: 1059; Rlordan et 
al. (1989) Science 245 : 1066, incorporated herein by reference) has been crucial for understanding the 
biochemical mechanism(s) underlying CF pathology. The most common mutation associated with CF . 

10 a 3-base-pair, in-frame deletion eliminating a phenylalanine at amino acidqsition 508 (AF508) of CFTR, 
has been found in about 70% of all CF chroiosomes (Kerem et al. (1989) Science 245: 1073; Kerem et 
al. (iQQQ^ Proc. Natl. Acad. Sci. (U.S.A. )87: 8447). Correction ofAF508 and other CFTR DNA nutations 
lies at the basis of DNA gene therapy for CF disease. Elimination of the cAMP-dependent C1 ion 
transport defect associatal with CFTR gene mutations has been accomplished through the introduction 

15 of the transcribed portion of wild-type CFTR cDSIA into CF epithelial cells (Rich et al. (1990) Nature 342: 
358: Drumm et al. (1990) Cell §2: 1227). 

An immortalized CF tracheobronchial epithelial humarcell line. ECFTE290-, is homozygous fbrthe^F508 
mutation (Kunzelmann et al. (1 993) Am, J. P^?Pir. MqI. PioL 3:522). These cells are useful as 
targets for homologous recombination analysis, because they contain the same 3 basepair deletion in 

20 CFTR allele on all copies of chomosome 7. Replacement of the AF608 allele with wild-type CFTR DNA 
in indiisated only when homologous recombination has occurred. The 491 bp region of the CFTR gene 
spanning exon 1 1 and containing 3' and 5' flanking intron sequences was selected from sequence data 
published previously (Zielenski et al. (1991) Genomics 10: 214, incorporated herein by reference) and 
used as a targeting polynucleotide. Thd)NA fragment was PCR amplified In preparative quantities and 

25 then denatured for introduction into cells as recA-coated complementary ssDNA (or dsDNA). 
Exponentially growing cdls were transfected by intranuclear microinjection and were propagated on the 
same petri dishes In which they wee microinjected. Cells outside the microinjected area were removed 
by scraping with a rubber policeman. Exponentially growing cells were typsinized and washed before 
electroporation. Cells transfected with protein-DNA-lipldbomplexes were grown to approximately 70-80(> 

30 confluence before transfection. 

The 491 bp fragment was generated by PCR amplification from the T6/20 plasmid (Rommenst al. (1989) 
op.cit. . incorporated herein by reference) ad verified by restriction enzyme mapping and propagated as 
described previously. After digestion with EcoRI and Hindlll. a 860 bp insert was isolated following 
electrophoresis in 0.8% SeaPlaque agarose gel. The 860 bp fragm^ contained CFTR exon 10, as well 
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as 6* and 3' intron sequences, as defined by the restriction enzyme cleavagatBS (Zielenski et a!. (1991) 
op cit). A 50 ng aliquot of the fragment was amplified by PGR using primers CF1 and CF5 (Table 1 ) to 
generate a 491 bp fragment. The conditions for amplification were denaturation, 94**C for 1 annealing, 
63"C for 30 sec; extension. 72**C for 30 sec witha 4 sec/cycle increase in the extension time for 40 cycles 
The fragment size was confirmed by electrophoresis on a 1% agarose gel, then amplified in bulk In 20 
separate PGR amplifications, each containing 50 ng of target DNA. The 491 bp PGR products were 
extracted with phenol:chloroform:isoamyl alcohol (25:24:1) extraction and preciptad with ethanol. DNA 
precipitates were collected by centrifugabn in an Eppendorf microcentrifuge and resuspended at a final 
concentratton of 1 mg/mL The 491 bp fragment contained exon 10 (193 bp), as well as 5' (163 bp) and 
3* (135 bp) flanking Intron sequences, as defined by primers CF1 and GF5. 

The 491 nucleotide fragments were coa*d with recA protein using the reaction buffer of Gheng (Gheng, 
et al. (1988) J. Biol. Chem. 263:15110, incorporated herein by reference). Typically, the 491 bp DNA 
fragment (Spg) was denatured at 95*'G for 10 rm, then added to a 63 pi of coating buffer containing 200 
pg of recA protein, 4.8 mM ATP(S. and 1.7 pi reaction buffer (100 mM Tris-Ac, pH 7.5 at37X; 10 mM 
dithiothreitol; 500 mM NaOAc. 20 mM MgOAc, 50 percent glycerol) and Incubated for 10 min at 37X. 
Next, the MgOAc concentration was increased to a final concentration of about 22 mM by addition of 7 
pi of 200 mM MgOAc. Under these conditions, the 491 nucleotide fragment was coated witboA protein 
at a molar ratio of 3 bases per 1 recA molecule. After coating therfgments were immediately . placed on 
ice at 4'C until transfection (10 min to 1 hr). 

Microinjection, when used, was performed with an Eppendorf 5242 microinjection pump fitted to an 
Eppendorf 6170 micromanipulator using borosilicate pipetis (Brunswick. 1,2 CD x 1.9ID) fabricated into 
a microneedle with a Sutter Instruments (P-87) micropipette puller. The micropipettes were filled by 
capillary force from the opposite side of the needle. Approximately 100 pipettes were used for injecting 
4000 cells. Cells were injected with approximately 1,000-10.000 fragments per cell by Intranuclear 
injection witi 120 hPa for 0.1-0.3 s at a volume of 1-10 A/nucleus. Microinjected celts were viewed with 
an Olympus IMT-2 inverted microscope during the injection. The area of the petri dish containing ix^d 
cells was mariced with 2 to 5 mm diameter rings. Needienicroinjection was performed in celts grown oh 
10 separate 60 mm petri dishes. Cells were injected at room temperature in culture medium after two 
washes in phosphate buffered saline (PBS). After microinjection, noninjected cells in the culture were 
removed by scraping. Injected cells were grown at 37X in a humidified incubator at 7 days and then 
harvested for DNA and RNA. 

Electroporation experiments were performed using recA-coated 491 -mer ssDNA as described above. 
Approximately 1 x Itf exponentially growing cells were supended in 400|jl of coating buffer with 5 pg of 
recA coated-DNA. The cell suspension wa pre-incubated on ice for 10 min and electroporated at room 
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temperature with 400 V and 400mF In a BTX 300 electroporator (BTX Corporation, San Diego. CA). Afte 
electroporation. cells were incubted on ice for an additional 10 min, diluted in Eagle's minimal essential 
medium (MEM) supplemented with 10% fetal bovine serum (FBS) and tD^g/ml streptomycin. 100 U/ml 
penicillin (Cozens et ai. (iQQ:> prnn Natl Acad. Sci. fU.SA \89: 5171; Gruenert et al. (1988) PrPCt N9tl 
5 Arad Sci /U SA^ fig: 5951 ; Kunzelmann. (1 992) op.cit.) . and then seeded In T75 flasks. Under these 
conditions of elecroporation. approximately 30-50% of the cells survive. Cells we cultured for 507 days 
at 37**C and then harvested for DNA and RNA. 

Protein DNA-lipId complexes (liposomes) were prepared. Briefly, dioleoylphosphatidyl-ethanolamine 
(RdEtn. DOPE) was used for preparing liposomes by (Jring 4 pM solutions of the lipid under nitrogen at 

10 room temperature. The lipid film was rehydrated with 4 ml of 30 mM Tris-HCI buffer (pH 9), then 
sonicated for 15 minutes under an atmosphere or argon. The protein-DNA complex was prepared in 
polystyrene tubes by diluting 20 pg of recA-coated 491 -base DNA in 30 mM Tris-HCI. (pH 9) buffer. 
Gramicidin S protein (GmS) was also diluted with 30 mM Tris HC1 (pH 9) to a final concentration of 2 
mg/ml from a 20 mg/ml stcck solution prepared in dimethyl sulfoxide. The protein (40 pg) was added to 

1 5 the DNA and rapidly mixed. Next, 175 pi of the liposome solution (175 nmoles Qp\6) were added to the 
peptide DNA mixture. 

Genomic DNA was isolated and purified from cells as described in ManlatjgaA to test for homologous 
DNA recombination. Cellular DNA was firsPCR-amplified with primers CF1 and CF6 (Table 1 ), CF1 is 
within the region of homologydefined at the 5* end of the 491 bp CFTR fragment CF6 is outskJe the regi© 
20 of homology at the 3' end of this fragment 

The conditions for the PCR amplification were as follows: CF1/CF6; 684/687 bp fragment: primers, 0, 5 
pM: DNA. 1-2 pg; denaturation; 94**C for 1 min; annealing; 53"C for 45 s; extension; 72X for 90 s wit h 
a 4-s/cycle increase In extension time for 40 cycles; Mg*^ 1.5 mM. DNA fragments were separated by 
agarose electrophoresis and visualized by staining with ethidium bromide, then transfen-ed to Gene 
25 Screen Plus filters (DuPont). The DNA was then hybridized with the allele-specific normal CFTT^P-end- 
labeled DNA probe defined by oligo N as described by Cozens et al. (1992) QP.cjt; Kunzelmann (1992) 
QD.cit. . incorporated herein by reference. The presence of wild-type (WT) sequences was determined 
autoradlographlcally by hybridization with the radiolabeled DNA probe. 

Homologous recombination was verified ina second round of PCR DNA amplification using the 687/684 
30 bp fragment as a DNA template for amplification. The primers used in this allele-specific reaction were 
CFI and the oligo N or oligoAF. The size of the DNA fragments was 300 bp (oligo N) or 299 bp (oligAF). 
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The conditions for the reaction wereas follows: CF1/oligo N/AF; 300/299 bp fragment; primers, 0.5 (jM; 
DNA, 1-2 \jQ\ denaturation, 95°C for 45s; annealing. SVC for 30s; extension, 72X for 30 s with a 3-s/cycl 
increase in extension time for 40 cycles; Mg*^ 1.5 mM. In DNA from transfected ECFTE29o- cells, 
amplified with the CF1/oligo N primers, a PGR product was detected only if the wild-type CFTR sequenoe 
5 were present. Amplification with the CFI/oligcAF gives a PGR DNA productof DNA targets purified from 
transfected and nontransfected 3GFTE29o- cells but not for DNA targetssolated from control nomial ce& 
(16HBE140-). The presence of wild-type CFTR sequences in the amplified DNA fragments was also 
determined autoradiographically after hybridization with ^P-5'-end-labeled oiigo N as probe. 

Cytoplasmic RNA was isolated and denatured at 95"C for 2 min. then reveestranscribed using the DNA 
10 polymerase provided in a PGR RNA Gene Amp kit according to manufacturer's instructions (Perkin- 
Elmer/Cetus). First strand cDNA was amplified by using primer GF17 at the 5' end of exon 9 and the 
allele-specific oiigo N or oligo^F primers. The length of the PGR fragments is 322 bp (CF17/oligo ^bnd 
321 bp(GF17/oligoAF). 

The conditions for PGR amplification are CF17/oligo N/AF. 322/321 bp fragment; primers, 1 pM; 

1 5 denaturation. 94'C fori min; annealing, src for 30s; extension. 72'C for 20s with a 4-s/cycle increase 
in extension time for 40 cycles; Mg*^, o.8 mM. DNA fragments were visualized after electrophoresis on 
ethidium bromide-stained 1% agarose gels. In addition to the allele-specific PGR amplificatfon of 
first-strand cDNA. Southern hybridization was performed as described above. Fragments were 
transferred to Gene Screen Plus filtes then hybridized with allele-specific oiigo N probe under the same 

20 conditions used for the Southern analysis of the genomic DNA (Kurdmann et al. (1 992) oo.cit.: Cozens 
et al. (1992) pp.cit). The presence of wild-type CFTR RNA was confirmed by hybridization and 
autoradiography of RNA extracted from normal (16HBE140-) control DNA and in DtiA of transfected 
3GFTE29o-cells. 

Hybridization was perfbnned as described previously (Cozens et al. (1993te*£iL). DNA fragments were 
25 separated by agarose gel electrophoresis. DNA was denatured with 0.4 N NaOl^d 0.6 M NaC1 for 30 
min, then washed oncewith 1.6 M NaCI and 0.5 M Tris-HCI for 30 min. DNA was transferred to Gene 
Screen Plus membrane (NEN-DuPont) by capillary blot, again denatured wi1fi.4 N NaOH for 1 min. and 
then neutralized with 0.2 M Tris-HC1 (pH 7.0). DNA on membranewas prehybridized for 1 h at 37X in 
6 X SSG. 5 X Denhardfs solution. 1% SDS, containing 100 pg/ml of denatured salmon sperm DNA 
30 (Sigma). Oligonucleotide probes (oligdM or oiigo AF; 10 ng) were ^P-5'-end-labeied with 20 units of T4 
kinase and 40 pCi ^2p-(-ATP for 30 min at 37X. Unincorporated nucleotides were removed by 
centrifugatbn of the reaction mix through a minispin column (Worthington Biochemical Corp., Freehold, 
NJ). Hybridization was perfonned overnight at ZTC. Membranesvere washed twice for 5 min each tina 
in 2 X SSG at room temperature, twice fofiO min in 2 x SSG. 0.1% SDS at 45X, and once in 0.1 x SSG 
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for 30 min at room temperature. After washing, hybrids on membranes were analyzed 
autoradiographically by exposure to x-ray film. 

Analysis of 3CFTE290- DNA shows replacement of the endogenous mutar(I^F508) sequences with the 
exogenous normal fragment as evidenced by PCR amplification of genomic DNA and allele-specific 

5 Southern blot hybridization. PCR primers, one inside (CF1). and one outside (CF6) the region of 
homology (491 bp), were used to test whether the enplified DNA band was possibly due to amplification 
of any residual DNA fragment remaining in the cell after the transfection or by possible random DNA 
integration. A 687 bp fragmentconlains normal CFTR sequences while the 684 bp fragment is generate 
from AF508 CFTR DNA. To determine whether endogenous AF508 sequences were replaced with 

10 exogenous normal CFTR sequences, we analyzd aliquots of the 687 or 684 bp amplification fragments 
by Southern hybridization using ^^p^nd-labeled DNA probes specific for the AF508 or wild-type 
sequences (Table 1). h addition, the 687 bp fragment was PCR amplified by using the CF6 primer and 
a primer specific for either AF508 (oligo AF) or normal sequences (oiigo N). The second round of DNA 
amplification with the CF1/oIigo N or AF primer pair combination yields 300/299 bp fragnients. 

1 5 respectively. With the CF1/oligo N primer pair combination, a fragment will be detteri only if the mutant 
DNA has been replaced by normal sequences. Futter confirmation of honrralogous DNA recombination 
was tested by allele-specific Southern blot hybridization of the 300/299 bp fragments. 

Analysis of cytoplasmic RNA to detect normal exon 10 sequences in CFTR mRNA. verify that the 
homologous DNA recombination was legitimate and that normal CFTR mRNA is expressed in the 

20 cytoplasm. To test whether the PCR geerated DNA fragments were exclusively CFTR mRNA-derived. 
primers in exon 9 (CF17) and allele-specific (normal, oligo N oAF508. oligo AF) primers in exon 10. Ths 
amplification with priners CF17/N yields a 322 bp normal fragment only If transcription of homologously 
recombined DNA has occurred. A 321 bp DNA fragment would be generated if thftF508 mutation were 
present. Furthermore. Southem hybridization analysis with allele-specific ^P-end-labeled probes 

25 differentiated between normal and AF508 mutant sequences and were also used to confinm expression 
of wild-type CFTR mRNA In the cytoplasm. 

Homologous recombination between the targeting polynucleotide coprising WT CFTR DNA and AF508 
mutant cellular DNA allelic targets was evaluated by analysis or cellular DNA and RNA isolated from 
transfected and nontransfected 3CFTE29o-ce!l cultures. Nuclear genomic DNA and cytoplasmic RNA 
30 were Isolated 6 days after transfection, CFTR exon I sequences were amplifeby PCR. Oligonucleotide 
primers (Table 1) were used to amplify the region of CFTR DNA spanning exon 10. One PCR primer (C 
I) was within the region of homology defined by the 491 bp DNA fragment (sense primer), and the other 
(CF 6) was outside the homologous region in the 3* intron (antisense primer). This DNA amplification 
reaction produces a 687 bp fragment with nonnal human CFTR DNA or a 684 bp fragment if the DNA 
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contains the AF508 mutation, as shown in Fig, 7A. Southern hybridization wacarried out on the 687/681 
bp DNA fragments generated from .amplification of genomic DNA from cell cultures by microinjection or 
by transfection with the protein-DNA-lipId complex, shown in Fig. 7B. A probe consisting of ^P-end- 
labeled oligonucleotide DNA that hybridized only to DNA sequences generated from a normal exon 1 0 
was used. DNA from all microinjected and transfected cells produced specific hybrids as evidenced by 
autoradiographic hybridization. For cells microinjected with the 491 nucleotide fragmerff ife. 7B, lane 2). 
the present of normal exon 10 sequences indicated homologous replacement at least a frequency of $ 
2.5 X 10-^. This result indicates at least one correctly targeted homologous DNA replacement in about 
4000 microinjected nuclei. Other similar experiments using either electroporation or protein-DNA-lipid 
transfection to transfer the recA-coated 491 nucleotide CFTR DNA fragments also showed homologous 
recombination with the normal CFTR sequence in transfected CF cells. No hybridization was observed 
in control nontransfected (or mock-injected 3CFTE29o- cells). In each cell transfectedith nonnal CFTR 
DNA, analysis of the genomic DNA in a second round of allele-specific amplification of the 681/684 bp 
fragments with primers CFI/oligo N (Table 1) clearly showed the 300 bp fragment expected/hen wild-type 
CFTR sequences are present, as shown inFig. 8A. Fragments were detected for control 16HBE140- ceti 
(Fig. 8A, lane 2) and cells transfected with recA-coated DNA (Fig, 8A, lanes 5 ancDfi A 299 bp fragment 
(AF508-specific primer ends one base closer to the CF1 than the oligo N) was detected in DNA from 
nontransfected 3CFTE29o- cells amplified with CF1/olig<AF primers (Fig. 8A, lane 4). No fragment was 
detected in DNA from nontransfected 3CFTE29o- cells reamplified with the ICI /oligo N primers (Fig. 8A, 
lane 3). Allele-specific Southern blot hybridization of these fragments with the ^P-end-labeled oligo N 
probe resulted in autoradiographic hybridization signals from control nomial and transfected CF cells (Fig 
8B. lanes 1 , 4. and 5) but not from DNA of nontransfected CF cells amplified tkTii CF1 and oligo-N or -AF 
(Fig. 8B lanes 2 and 3). We tested whether any residual 491 nucleotide DNA fragments, which might 
remain in the cell after 6 days could act as a primer for the PCteaction, genomic 3CFTE29o- DNA was 
incubatedwith an equivalent number of recA-coated DNAragments (1CP-10*) introduced by microinjectioi 
(Fig. 9). One antisense primer contains the wild-type nonnal (N) sequence while the other contains the 
AF508 (AF) mutation. Amplification the CFI/AF primer combination gives a 299 bp fragments when 
the AF508 mutation is present. No DNA fragment product was detected when the CF1/N primer 
combination we ised with control nontransfected 3CFTE29o- DNA (Fig. 9, lane 2). However, when the 
CF1/AF primer ccmbination was used for DNA amplification in nontransfected 3CFTE29o- cells, a DNA 
product of the expected size (299 bp) was produced (Fig. 9. lane 1). These results Indicate that all 
residual 491 nucleotide DNA fragments which might remain in the cells after 6 days of culture were 
incapable of competing with the CF1 PGR primers in the PGR aniffication of the 687/684 bp fragments. 

labial 

PGR Primers and Oligonucleotides 
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OliQonuclectide DNA Strand PNA gequgnge 

CF1 S 5'-6CAGAGTACCTGAAACAGGA 

CF5 A 5'-CATTCACAGTAGCTTACCCA 

CP6 A 5'-CCACATATCACTATAT6CATGC 

S PC.R Primers arwj QHffftniideotides 

Olioonuclectide DNA Strand PNAgMM^nce 
CF17 S 5'-GAGGGATTTGGGGAATTATTTG 

OLITGON A S'-CACCAAAGATGATATTTTC 

OLIGO'F A 5*-AACACCAAGATATTTTCTT 

10 Notes: 

(1) CF1 and CF5 PGR primers were used to synthesize the 491 bp fragment used for the- 
targeting polynucleotide. 

(2) CF1 and CF6 PCR primers were used to amplify the 687/684 bp CFTR fragment 

(3) The CF17 primer is located at the 5' end of exon 9 and was used for amplification of first 
15 strand cDNA derived from CFTR mRNA. 

(4) Oligp N and Oligo AF are allele-specific probes and can also be used as allele-specific 
PCR primers for amplifying the 300/299 bp fragnwnts (DNA analysis) and the 322«21 bp fragments 
(RNA analysis). 

(5) Sense (S) and antisense (A) primers are designated under DNA Strand and indicate the 
20 sense of the strand relative to the transcribed direction (i.e.. the CFTR mRNA). 

The corrected CFTR DNA must also be expressed at the mRNA level for nonmal function to be 
restored. Therefore, cytoplasmic CFTR mRNA was analyzed for the presence of a normal CFTR RNA 
sequence in the AF508 region of exon 10. Cytoplasmic RNA was isolated from the cells, reverse- 
transcribed with DNA polymerase and PCR-amplified as first-strand cDNA. This amplification was 

25 performed vwth a PCR primer located in exon 9 (CF17, sense) and CFTR allele-specific PCR primer In 
exon 10 (oligo N or AF, antisense). The exon 10 primer contains the CF mutation site, and the 
resulting fragment is 322 bp in nomial DNA or 321 bp in DNA containing the AF508 mutation. 
Amplification of genomic DNA is eliminated by using primers that require amplificatipn across 
Intron/exon boundaries. Amplified cDNA generated from normal control 16HBE140- cells and 

30 experimentally trarisfected cells yielded DNA product fragments with the CF1 7/oligo N. whereas 

nontransfected 3CFTE29o- cells only showed a DNA fragment after amplification with the CF17/oligo 
AF primers but not wrfth the CF17/oligo N primers. Cells electroporated with wild-type 491-mer CFTR 
DNA showed the presence of wild-type CFTR nriRNA. In addition. protein-DNA-lipid-transfected 
3CFTE290- cell cultures also showed the presence of wild-type CFTR mRNA in cells transfected with 
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the recA-coated 491 nucleotide fragment. Southern hybridization of the 322/321 bp cDNA fragments 
with the ^^P-end-labeled N oligonucleotide DNA probe showed the specificity of the PGR amplification 
and produced specific autoradiographic hybridization signals from all cell cultures transfected with 
recA-coated 491 nucleotide targeting polynucleotide. No autoradiographic hybridization signals were 
5 detected in nontransfected 3CFTE29o- cells amplified with the CF17/oligo N or oligo AF primers. 
These analyses verify that the genomic DNA homologously recombined with the WT 491-mer DNA at 
the AF608 CFTR DNA locus resulting in RNA expressed and transported to the cytoplasm as wild- 
type CFTR mRNA. 

This evidence demonstrates that human CF AF508 epithelial cells CFTR DNA can homologously 
1 0 recombine with targeting polynucleotides comprising small fragments of WT CFTR DNA resulting in a 
con-ected genomic CFTR allele, and that a recA-coated targeting polynucleotide can be used in 
transfection reactions in cultured human cells, and that cystic fibrosis AF508 mutations can be 
con-ected in genome DNA resulting in the production of normal CFTR cytoplasmic mRNA. 

Taken together, the data provided indicates that 491 -mer ssDNA fragments can find their genomic 
15 homologues when coated with recA protein and efficiently produce homologously targeted intact cells 
having a corrected gene sequence. Analysis of CFTR in cytoplasmic RNA and genomic DNA by 
allele-specific polymerase chain reaction (PCR) amplification and Southern hybridization indicated 
wild-type CFTR DNA sequences were introduced at the appropriate nuclear genomic DNA locus and 
was expressed as CFTR mRNA in transfected cell cultures. Thus, in human CF airway epithelial cells, 
20 491 nucleotide cytoplasmic DNA fragments can target and replace the homologous region of CFTR 
DNA containing a 3 bp AF508 deletion. 

Correctly targeted homologous recombination was detected in one out of one microinjection 
experiment vwth recA-coated targeting polynucleotide, two of two different electroporatidn experiments 
with recA-coated targeting polynucleotide, and one of one lipid-DNA-protein complex transfection 
25 experiment with recA-coated targeting polynucleotide. Taken together, these 4 separate experiments 
strongly indicate that homologous recombination with recA-coated targeting polynucleotides (491-mer 
CFTR DNA) is feasible for treatment of human genetic diseases, and can be performed successfully 
by using various methods for delivering the targeting polynucleotide-recombinase complex. 

EXAMPLE 4 

30 HomoloQQus recombina tion in procarvotic cells 

In order to study the biological consequences of the cssDNA probe:target hybrid DNA structures in 
cells, we developed a simple and elegant assay to rapidly screen for in vivo homologous 
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recombination events in Escherichia coli. The principle of this assay is to screen for the 
recombinogenocity of hybrid stnjctures formed between a dsDNA plasmid target carrying a 59 bp 
deletion in the lacZ gene (pRD.59) and cssDNA probes from the wild type lacZ (IP290) gene by 
introducing these pre-formed protein-free hybrids Into £. coli by electroporation (Figure 10). 
5 Homologous recombination frequencies are scored by plating transformed cultures in the presence of 
a chromogenic substrate (X-gal) so that recombinant bacterial cells (carrying plasmids that encode a 
wild type lacZ gene resulting from homologous recombination) appear blue. 

DMA plasmids and DNA probes: The plasmid pRD.59 was made from the 2,9 kb cloning vector 
pBluescript IISK(-) (pRD.O) (Stratagene). The pRD.O DNA was linearized at a unique EcoRI site in the 

10 polylinker region of the lacZ gene and digested with mung bean nuclease (Boehringer-Mannheim). 
The plasmids were then ligated and transformed into the RecA(-) E coli host XL1-Blue (Stratagene). 
The resulting alpha peptide mutant clones were screened for lack of alpha-complementation of 
galactosidase activity, which results in white colonies when grown on plates containing X-gal and 
IPTG (Sambrook et al., 1989). Plasmid DNAs recovered from white colonies by a mini-prep procedure 

1 5 (Qiagen) lacked the unique EcoRI site, as well as the Xhol and Xbal sites. These mutant clones were 
then sequenced using Sanger dideoxy sequencing methods (Sequenase Kit verston 2, USB) to 
determine the length of the deletion. Several clones containing delettons ranging from 4 bp to 967 bp 
were sequenced and named pRD for plasmids with an EcoRI deletion. The cloning vector pBluescript 
IISK(-) was named pRD.O because it does not contain any deletions. 

20 All samples of the plasmid DNA were then prepared by the Qiagen Maxi-Prep (Qiagen) procedure 
from strain of XL1-Blue (Stratagene) containing the plasmids. The cultures were grown on Luria-Broth 
(LB) media (Sambrook, et al., 1989) containing 100 |jg/ml ampicillin. Recovered plasmids were more 
than 90% negatively supercoiled Form I DNA as judged by agarose gel electrophoresis. 

Biotinylated cssDNA probes were made from a fragment of the normal pBluescript IISK(-) plasmid. 

25 The plasmid DNA was linearized with Bgll and run on a 1% agarose gel in IX TAE. After ethidium 
bromide staining, the 1.6 kB fragment band was excised from the gel and purified using the Qiaex II 
gel purification method (Qiagen). This 1.6 kb fragment was diluted 1:20 and then used as a template 
for PCR, The PGR reaction mixture contained biotin-14-dATP (GIBCO-BRL) in order to synthesize 
IP290. a 290 bp biotinylated cssDNA probe homologous to the LacZ region of pRD.O. In addition. 

30 pRD.59 was linearized with Bgll and the 1 .55 kb fragment was purified in the same manner as the 

pRD.O 1.6 kb fragment. Using the same primers that were used to synthesize IP290. the pRD.59 1.55 
kb fragment was used as a template for PCR to synthesize DP231. a 231 bp biotinylated cssDNA 
probe homologous to the LacZ region of pRD.59. It is missing the 59 base pair sequence that flanks 
the EcoRI site. Biotinylated cssDNA probe CP443 was made In the same manner except that pRD.O 
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was linearized with Oral and different primers were used. CP443 is completely homologous to pRD.O 
andpRD.59 in a region outside of the LacZ gene 

RepA mg^iiated cssDNA targeting reactions and purification of nrn b e:taraf>t hna hy hp^^" Before 
targeting, biotinylated cssDNA probes (70 ng) were denatured by heat at 98'C for 10 minutes, cooled 
5 immediately in an ice-water bath, and then oentrifuged at 4'C for 10 seconds to recover all liquids. 
Reactions without cssDNA probe contained equivalent volumes of water. The denatured cssDNA 
probes were then coated with RecA protein (Boehringer-Mannheim) in Tris-acetate reaction buffer 
(Cheng et al.. 1988; 10 mM Tris-acetate (pH 7.5). 1 mM dithiothreitol. 50 mM sodium acetate. 2 mM 
magnesium acetate, 5% (v/v) glycerol) with 2.43 mM ATPS for 1 5 minutes at 37'C in a 10 pi volume. 
1 0 Reactions without the RecA protein contained equivalent volumes of RecA storage buffer (20 mM Tris- 
HCI. pH 7.5. 0.1 mM EDTA. 1 mM DTT, and 20% glycerol). 

The RecA mediated targeting reacBons were performed by adding 1- 4 ^jg of the appropriate plasmid 
DNA in an aqueous solution containing 22 mM magnesium acetate, bringing the final magnesium 
concentration to 11 mM and the final reaction volume to 20 jjI. The reaction was incubated for another 
IS 60 minutes at 37°C. 

At the end of the targeting reaction. SDS was added to a final concentration of 1.2% to deproteinize 
the complexes. If further enzymatic treatments were necessary on the targeted complexes. 3 volumes 
of phenol:choloform.isoamyl alcohol (Sigma), shaken on a Multi-Tube Vortexer (VWR) for 4 minutes at 
4°C. and centrifuged for 5 minutes at 4°C. The supematant was recovered, placed in a new tube, and 
20 extracted with 1 volume of chloroform. The mixture was shaken for 2 minutes at 4'C, and centrifuged 
for 6 minutes at 4*C. The supematant was recovered, containing the purified targeted complexes. 

Detection of prpbeiterget PNA hyhrWr?- After deproteinization, the complexes were run for 20 hours at 

30 V on a 20 cm by 25 cm 1% agarose TAE gel (GIBCO-BRL) at room temperature. The gels were 

visualized by staining in 1 pg/ml ethidium bromide and then cut down to 11 cm by 14 cm before they 

25 were soaked in 1 0X SSC and transferred to positively charged Tropilon membranes (Tropix) by 

Southern blotting method under non-denaturing conditions. Blots were then UV cross-linked 
(Strataiinker). 

Biotinylated cssDNA probes and probe:target hybrids were detected using the Southern-Light System 
(Tropix). The nylon bound DNA btots were treated with avidin conjugated to alkaline phosphatase. 
30 followed by the chemiluminescent substrate. CDP-Star (Tropix). in conditions described by the 

manufacturer. Blots were exposed to X-ray film (Kodak) for varying times (1 minute to 8 minutes) and 
developed. 
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Rftrtro poration of probeiaraet DNA hybrids into meta hnlicallv active E. co// ceils: After purification of 
targeted complexes. 40 pi of electro-competent RecA(+) and/or RecA(-) E coli (Dower et al., 1988) 
was added to 30-200 ng of the targeted complexes in a chilled microfuge tube. The RecA(+) cells 
were BB4 (Stratagene) and the RecA(-) cells were XL1-Blue (Stratagene). The mixture was incubated 
5 on ice for 1 minute. This mixture was then transferred to a chilled 0.1 cm gap electroporation cuvette 
(Bio-Rad) and electroporated under the following conditions: 1 .3 V. 200 ohms. 25 pF on a Bio-Rad 
Gene Pulser. The time constant ranged from 4,5 - 4.7 msec. Immediately aftenwards. 1 mL of SOC 
media (Sambrook. et al.. 1989) was added and the mixture was transferred into a 10 mL culture tube. 
After all the electroporation groups were finished, the tubes were shaken at 225 rpm at 37*C for 1 
1 0 hour. Appropriate amounts were plated onto LB agar plates which already contained 100 pg/ml 

ampicillin (Sigma). 20 pg/ml X^al (GIBCO-BRL). and 48 pg/ml IPTG (GIBCO-BRL). and incubated at 
37"C overnight 

Rrreenino for homologous DNA recombination in LacZ: After overnight Incubation (approximately 16 
hrs.), colonies were counted to determine electroporation efficiency and scored for any blue colonies 

1 5 in plates. Blue colonies were scored if they resembled blue colonies displayed by the control plasmid 
pBluescript II SK(-). which is able to undergo alpha-complementation and produce blue colonies. Blue 
colonies were serially propagated on AIX plates at least twice to confirm recombinant stability as 
monitored by consistency of color. When the colonial streaks displayed a homogeneous color, 
plasmlds were isolated by a mini-prep and digested with EcoRl. Xhol. and Pvull to confirm 

20 homologous recombination of the plasmid at the DNA level. EcoRI and Xhol sites are restored if 
homologous recombination has occun^d. Pvull restriction sites which flank the LacZ region contains 
the 59- base pair deletion; if recombination has occun-ed, this fragment will be significantly larger than 
fragments lacking the 59 base pairs after digestion with Pvull. 

RfirA mediated cssDNA taroetlna to negatively suoercoile d dsDNA substrates contaihinq deletions: 
25 Stable probe:target hybrids formed in the RecA mediated targeting reaction between the biotinylated 
RecA coated cssDNA probes IP290 and the negatively supercolled Fomi 1 dsDNA targets pRD.59. 
which contain a 59 base pair deletion respective to the cssDNA probe, were monitored by 
chemiluminescent detection of biotinylated hybrids (Figure 11). The presence of a sizable region of 
non-homologous nucleotide sequences (59 bp) in the cssDNA probe IP290 does not significantly 
30 affect the ability of the RecA coated cssDNA probe 1P2Q0 to form stable probe:target hybrids with 
.pRD.59 in comparison to completely homologous dsDNA pRD.O (Figure 11, lane 3 and 6). In each 
reaction, under these conditions, the presence of the RecA protein was absolutely required for hybrid 
detection (Figure 1 1 . lane 2 and 5). 
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Probe;target DNA hybrids formed when the RecA coated biotinylated cssDNA probe IP290 is 
hybridized to the completely homologous dsDNA target pRD.6 differ from probertarget hybrids formed 
when the same cssDNA probe is hybridized to the dsDNA target pRD.59 containing a 59 base pair 
deletion with respect to IP290. While more than 90% of both the dsDNA targets exist as negatively 
5 supercoiled Form I DNA. when hybrids formed between pRD.O and RecA coated cssDNA probe IP290 
are deproteinized. the probertarget hybrids migrate to a position that is similar to the migration of Form 
II. relaxed circular dsDNA. in 1% agarose gel in 1X TAE buffer (Figure 11. lane 3 and 6). and there 
was no evidence of probertarget hybrids that co-migrate to Fomi I DNA on a 1% agarose gel (Figure 
11. lane 3). This probertarget hybrid is referred to as a relaxed Form I* hybrid or a ri* hybrid because 
1 0 the hybrid has the same elelctrophoretic mobility as relaxed circular DNA. In contrast, when the RecA 
coated cssDNA probe IP290 was hybridized to the dsDNA target pRD.59. which as a 59 bp deletion 
with respect to the probe, two different probertarget hybrids were apparent. One has an 
electrophoretic mobility comparable to that of Form I supercoiled dsDNA (Figure 1 1 . lane 6) while the 
other migrates to the same position as the ri* hybrid. These two forms appear to be present in equal 
15 amounts as indicated by the signal from chemiluminescent DNA detection. This probertarget hybrid is 
referred to as a Fomi r hybrid or I* hybrid, differentiating it from Form I DNA because it is targeted 
with RecA coated cssDNA probe. In order to exclude the possibility that it is the structure of the 
dsDNA target that creates the fonnation of two major probertarget hybrid products, the cssDNA probe 
DP231 was hybridized to pRD.59. The cssDNA probe DP231 is completely horriologous to the mutant 
region of the LacZ gene in pRD.59. The only probertarget hybrid detected has the electrophoretic 
mobility of Fomi II dsDNA, the r\' hybrid (Figure 1 1 , lane 8). In addition, when the cssDNA piobe 
CP443, which is completely homologous to a region outside of the 59 base pair detetion. was 
hybridized to pRD.59. only the rl* hybrid product was detected (Figure 1 1 , lane 1 0). Thus, when the 
RecA coated cssDNA probes are targeted to homologous sequences, only the ri* hybrid is present, 
>5 but when it is targeted to homologous sequences with relatively short heterologies, two fbnns of 
hybrids, rl* and I* hybrids are fomned in apparently equivalent amounts. 

Rewmbinoqenicity of pmhe-ffinet PNA hvWj;- To study the biological consequences of the 
probertarget hybrid structures . we assayed for putative homologous recombination events in £ co// by 
the electroporation assay (described in Figure 10). 

0 Figure 12 shows the percentage of potential recombinant blue colonies fomied when IP290 
proberpRD.59 target hybrids wfere electroporated into RecA+ and RecA- cells. Blue colonies only 
arose when deproteinized hybrids formed with pRD.59 and cssDNA probe IP290 are introduced into 
RecA+ E. coil cells. Control experiments perfomied with cssDNA probes homologous to the mutant 
LacZ region of pRD.59 (DP231) and homologous to a region outside of the LacZ gene (CP443) did not 
yield any blue colonies. (Figure 12). In addition, when all of these hybrids were transformed into 
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RecA(-) hosts, no blue colonies were produced from any type of hybrid, indicating the the 
recombinogenic effect is also dependent on endogenous RecA protein produced in the cell. Thus only 
the cssDNA probe containing the 59 base pair correction produces recombinogenic clones in bacterial 
host cells that are RecA(+). 

When potential homologous recombinant blue colonies were propagated by streaking out on AIX 
plates, only 50% of the colonies were blue. When a blue colony from the first streak was propagated 
by recombinant streaking, the colonies remained stably blue over several generations. If piasmid 
DNA v\«s isolated from third generation propagations and then transfbnnfied into RecA(-) cells, this 
resulted in blue cotonies which remained stably blue on continued propagation. Of the potential 
recombinants that have been rigorously screened by restriction enzyme digestion, at least 67% of the 
plasmids recovered from blue colonies are true homologous recombinants. This was deterimined by 
the restoration of EcoRI and Xhol restriction sites, and a Pvull digest of the DNA shows a fragment 
that migrates at a higher molecular weight than fragments which are missing the 59 base pair region. 

This is consistent with the view that only one strand is exchanged in these hybrids to form 
heteroduplex targets and that upon replication one strand will produce a piasmid that contains the 59 
base pair con«ction while the other does produces the mutant pRD59 piasmid. 

As outlined in Example 5, we show that the recombinogenlcity wi8i probe:target hybrids of cssDNA 
probes and dsDNA targets containing deletions is associated with the re-annealing of regions of 
cssDNA probe that can not hybridize to dsDNA targets, by creating internal homotogy clamps (Figure 
13). 

EXAMPLES 

Enhanced homologous rpmmbination with targets c nntalnino insertions and deletion? gpntaininq 
Internal hom ology clamps 

An in vitro DNA hybridization reaction that allows the pairing of RecA-coated complementary single- 
stranded (CSS) DNA probes to homologous regions in linear duplex target DNA has been used to study 
the effects of heterologies within the regions of homotogy between the probes, and tirget DNA. In 
cssDNA targeting reactions catalysed by RecA protein. cssDNA probes are kinetically trapped within 
the duplex DNA target at honrwiogous sites and form a highly stable four-stranded DNA hybrid 
structure. After removal of RecA protein, this homologous recombination reaction can be trapped at 
the DNA pairing step. The effect of defined heterologous insertions or deletions in linear duplex 
targets on the pairing of RecA-coated cssDNA probes was determined for heterologies ranging from 4 
to 967 bp. We demonstrate that small deletions and insertions up to 10% of the total cssDNA probe 
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lengths, ranging from 215 -1246 bp do not significantly affect DNA pairing. Furthernrwre both 
insertions and deletions of the same size in the cssDNA probe have the same effect on DNA pairing. 
IVIoreover, large deletions, up to 967 bp, can be tolerated in deproteinized hybrids form with a RecA- 
coated 1.2 kb cssDNA probe. The stability of these hybrids with heterologous sequences within the 
5 homologous paired region is due to the re-annealing of the cssDNA probes to each other within the 
DNA hybrid producing a novel four-stranded heteroduplex DNA intermediate that contains a novel 
internal base-paired homology clamp. 

Preparation of ds target substrates- A series of plasmid DNA targets with defined deletions were 
constructed by linearization of the plasmid vector pBluescript IISK(-) (Stratagene) at a unique EooRI 

10 restriction site in the polylinker regton following digestbn with mung bean exonuclease {Boehringer- 
Mannheim), DNA ligation, and subsequent transformation Into XLI-Blue E coli (Stratagene) by 
standard methods. The resulting clones were sequenced using Sanger dideoxy sequencing methods 
(Sequenase Kit version 2, USB) to determine the extent of deletion. A series of plasmids with 
deletions ranging from 4 to 967 bp were prepared and named for the extent of size of the deletion 

15 (see Figure 15), The size of the parent plasmid, pBluescript IISK(-). refen-ed to as pRD.O in this study, 
is 2960 bp. Plasmid DNA was prepared by a modified alkaline lysis procedure with anion-exchange 
purification (Qiagen). The DNA was further purified by phenol-chlorofomi-isbamyl alcohol extraction 
(24:25:1) (SIGMA) and ethanol precipitation, and then resuspended in TE (10 mM Tris HCI. pH7.5, 1 
mM EDTA).buffer. These preparations contained greater than 90% Fomi I DNA. Preparations of 
20 linearized Form III DNA were made by digestion of the plasmids at a unique Seal restriction site 

outside the polylinker, followed by phenol-chloroform-isoamyl alcohol extraction (SIGMA), chloroform 
extraction, ethanol precipitation, and resuspension in TE buffer. 

Preparation of cssDNA prnhpy Biotin-labeled probes homologous to pRD.O were synthesized by 
PCR with Incorporation of biotin-14-dATP using previously described methods where the molar ratio of 

25 unlabelled dATP to biotin-labelled dATP was 3:1 (Griffin & Griffin, 1995). Primer pairs flanking the 
polylinker region of pRD.O or analogous plasmids with a deletion were chosen to produce PCR 
fragments which span the deletion in the target plasimlds. In addition a control PCR fragment (CP443) 
primer pair flanking sequences outside the polylinker was selected for production of a probe 
homologous to all clones in the plasmid series. The oligonucleotide products were, purified by 

30 membrane ultrafiltration using Microcon 100 filters (Amicon). 

Taroetina of cssDNA probes to dsDN A targets in solution: cssDNA targeting was performed 
essentially as described in Sena & Zariing (1 993), with the exception that cssDNA probes were 
synthesized and labeled by PCR in the presence of biotin-14-dATP (GIBCO/BRL), as indicated above. 
In each reaction 70 ng of biotin-labelled RecA-coated cssDNA probe was reacted with 1 pg of Sca1- 
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digested target DNA, resulting in cssDNA probertarget ratios of 1:1 (for 215 bp cssDNA probes) to 1:5 
(for 1246 bp cssDNA probes). The products of the targeting reactions were deproteinized by 
treatment with SDS (1.2% final concentration) or phenol:chloroform: isoanDyl alcohol (24:25:1) and 
chlorofomn extraction and then separated by electrophoresis on 1% agarose gels in TAE buffer. The 
5 gels were run at 2V/cm at room temperature in the absence of ethidium bromide for 20 hours. After 
electrophoresis, gels were stained in 1 \igfrr\\ ethidium bromide for 15 min. The DNA was transferred 
under non-denaturing conditions (10X SSC) onto nylon membranes (Tropix) and cross-linked using a 
Stratalinl<er (Stratagene) on the auto-crosslink setting. The extents of biotinylated 
cssDNAprobertarget hybrid formation was measured by quantitating the amount of biotin-labeled 
10 probe DNA that co-migrates with dsDNA target DNA following electrophoretic separation of these 
biotinylated probeitarget hybrid products from free unhybridized probe DNA. The amount of 
biotinylated probe DNA In probe:target complexes was visualized with a chemiluminescent substrate 
conjugated to streptavidin (CDP-STAR) (Tropix) after exposure to XAR-5 film (Kodak). The levels of 
exposure were analyzed by densitometry and quantitated using the software package. NIH Image. 

1 5 In each case the relative level of hybrid formation with heterologous targets was expressed as a 
percentage of the level of hybrid formation of standardized reactions with a completely homologous 
probe and target These values were normalized to the level of hybrid formation that occured with 
control probe CP443 which hybridizes to all of the plasmid targets in a region away from the 
heterology. The data generally represent averages of at least three separate measurements from 

20 three Independent targeting reactions. 

Nomenclature and Assay for RecA-mediated pairing of cssDNA prob es to dsDNA targets.: To 
investigate the effects of heterologous insertions and deletions on homologous pairing of cssDNA 
probes to double-stranded linear plasmid DNA, we employed a modification of an in vitro DNA 
targeting assay described in Sena and Zariing (1993). The target DNAs used in this study are a series 

25 of plasmid DNA constnjcts that contain defined deletions at the unique EcoRI site in pRD.O 

(pbluescriptllSK(+), Stratagene Figure 14A). Plasmid targets (pRD.4 - pRD.967) are named for the 
size of deletion in bp at the EcoRI site. CssDNA probes were made and labelled with biotin-14-dATP 
by PGR using primers which symetrically flank the deleted region of plasmids in the pRD series. 
CssDNA probes made from pRD.O that were targeted to plasmids containing deletions are called 

30 insertion probes and named for the length of the probe in bp. For example. IP290 is a 290 bp cssDNA 
probe that contains an insertion with respect to a target containing a deletion, but is completely 
homologous to pRD.O. A cssDNA probe made from pRD.59 and targeted to pRD.O is called DP231. 
since it contains a deletion with respect to pRD.O. but is completely homologous to pRD.59. 
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After the hybridization of RecA-coated cssDNA probes with dsDNA targets, the reactions products 
were separated by agarose gel electrophoresis. The extent of fomiatlon of stable deproteinized 
cssDNA probe:target hybrid was measured by the quantitation of the amount of biotinylated cssDNA 
probes that co-migrated with the dsDNA targets. In each case the level of probe:target formation 
between a totally homologous probe and target was normalized to 100%. Previous studies have 
shown that efficient cssDNA targeting is completely dependent on RecA protein, the nucleotide co- 
fector. specific to homologous DNA targets and that fbmiation of deproteinized stable probe:target 
hybrids also requires both cssDNA strands (Sena and Zarling. 1993. R6vet et al. 1993). Furthermore 
we targeted Seal-digested pRD.O with two synthetic RecA-coated 121-mer cssDNA oligonucleotides 
homologous to the region symetrically spanning the EcoR1 site in pRD.O and demonstrated that botti 
cssDNA strands are required for stable hybrid fomiation with linearized pRD.O targets (data not 
shown). 

Sta|?lg cssDNA probe:tarqet hybrids are formed In linpar d sDNA taroats with deletions at internal stes 
To determine if a target DNA deletion affects the reaction kinetics of RecA-mediated cssDNA pairing 
to linear DNA targets, we measured the relative amount of deproteinized cssDNA probe:target hybrid 
formation over time in reactions using c^DNA probe IP290 with either a completely homologous linear 
target, pRD.O or a target carrying a 59 bp deletion. pRD.59. Probe IP290 symetrically spans the 59 bp 
deletion in pRD.59. Figure 15B shows that in steady state hybrid reactions, the maximum level of 
stable hybrid formation when RecA-coated IP290 is targeted to pRD.59 is 62% of the steady state 
level obtained with the fully homologous target pRD.O. Furthennore steady state levels of hybrid 
fomiation occurs within 45 minutes with fully homologous pRD.O targets, but requires 2 hours for 
pRD.59 targets. Thus, in all subsequent experiments RecA-coated probes were hybridized for 2 hours 
at 37X with the linear target DNAs. 

The effect of duplex DNA target deletions on the formation of deproteinized cssDNA probe: target 
hybrids was determined by hybridizing RecA coated cssDNA probes which span the deleted regions in 
pRD.4 - pRD.298 on DNA targets linearized by Seal (Figure 15A). The relative amount of bfotinylated 
probe:target hybrids formed with each of these targets was compared with the amount of cssDNA 
probe target hybrids formed with pRD.O. These values were nonnalized to the level of hybrid 
formation obtained with the control probe. CP443. which is homologous to a region avray from the 
deleted regions or pRD.O and thus, is completely homologous to all target DNA substrates used in this 
study. 

Our initial studies tested the effect of small target deletions on targeting efficiency using either cssDNA 
probes IP527 or IP407 (Figure 15B and 15C). Because the 5'- and 3'-termini of both of these cssDNA 
probes are approximately symmetric with respect to the 4 to 59 bp deletions, the differences in the 
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efficiency of hybrid formation are not due to the effects of the position of the deletion with respect to 
the probe in relation to probe ends. As expected, in experiments using either the IP527 or IP407 we 
observed a decrease In the level of hybrid fomiation with an increase deletion size. These data also 
show that relatively small deletions (< 25 bp) in the target do not dramatically affect the overall 
5 targeting efficiency of cssDNA probes to linear targets and that the deletions have relatively the same 
effect on the hybridization on either IP527 and IP407. However when the size of the deletion is 
increased to 59 bp (1 1 % of the length of IP527). the relative targeting efficiency of probes IP527 and 
IP407 drops, to 61% and 33%. respectively. Furthermore the anwunt of the difference between the 
targeting efficiency mediated by these probes continues to increase linearly as the size of the deletion 
10 increases (Figure 15D). This indicates that when the size of the deletion is >10% of the length of the 
probe the efficiency of RecA-mediated DNA targeting is governed by the amount of homology between 
the cssDNA probe and target, while deletions <10% of the length of the probe are well tolerated for 
any length of cssDNA probe. Similar effects are observed with smaller cssDNA probes IP452. IP290 
(data not shown) and IP215 (Figure 16). 

15 HeterolQQQUs insertions and deletions are similarlv tole rated in the hvbridization of CSSPNA probes to 
finpar dfiPNA taroets. Other studies by Bianchi and Redding (Cell 35:51 1-520 (1983)) in which RecA- 
coated circular ssDNA was hybridized to linear duplex targets demonstrated that heterologous Inserts 
in the ssDNA were tolerated somewhat better than when the Insert was in the dsDNA. presumably 
because the inserts in ssDNA could be folded out of the way. In contrast Morel et al (J. Biol. Chem. 

20 269:19830 (1994)) used somewhat similar substrates and demonstrated that RecA-mediated strand 
exchange could bypass heterologies with equal efficiency whether the insert was in the ssDNA or 
dsDNA. Since the formaUon of stable cssDNA:probe target hybrids with internal sequences in linear 
dsDNA requires two cssDNA probe strands, we compared the effects of insertions in the cssDNA 
probe with having the same sized insertion in the dsDNA to determine how these intemal heterologies 

25 maybe accommodated within a four strand containing double-D-loop DNA structure. 

In these studies we compared the effects of 4 to 69 bp insertions In either the dsDNA target or 
cssDNA probe (deletion in target) using cssDNA probes ranging in size from 156 bp to 215 bp. We 
used this smaller cssDNA probe to maximize the effects of the insertion or deletion of these sizes. 
We prepared cssDNA probe IP215 from pRD.O using PCR and targeted pRD.O, pRD.4, pRD.25, and 
30 pRD.59 to measure the effects of insertions in cssDNA probes (target DNA deletion). Then using the 
same PCR primer set, we prepared cssDNA probes from templates pRD.O, pRD.4, pRD.25, and 
pRD.59 and then targeted pRD.O to measure the effects of deletions in cssDNA (target DNA insertion). 
Figure 16 shows that both deletions and insertions of the same size have exactly the same effect on 
RecA-mediated cssDNA targeting and are equally tolerated and stable. 
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t-arqg deletion? jn linggr DNA are tolerated in cssDNA nrnhp tara«.t hvhrirtc with 1^ .^, h^^k,^ t„ 
further define the extents of heterology that can be tolerated during cssDNA hybridization, we studied 
the effect of very large deletions, up to 448-967 bp on the targeting efficiency using a 1246 bp 
CSSDNA probe (IP1246) (Figure 17A) . With target deletions in the range of 500 bp (approx. 50% of 
5 the cssDNA probe length) there is only a slight reduction in the targeting efficiency achieved with this 
probe (80%). surprisingly the IP1246 can hybridize target DNA molecules bearing deletions up to 967 
bp at a detectable efficiency (27%). When IP1246 Is targeted to pRD.967. there are a total of 279 bp 
of homology between the cssDNA probe and target, with 147 bp 5' to the 967 bp insert and 132 bp 3' 
to the insert (Figure 17B). In order to account for such a high level of targeting efficiency with such a 
10 large deletion, we predict that the 967 bp insert in the two in the cssDNA probe strands, which are 
homologous to each other, may interact with each other to stabilize this hybrid. 

Furthermore when using a large cssDNA probes of 1246 bp we can observe a visible shift the 
migration of the cssDNA probertarget hybrid in comparison to the linear dsDN A target The positions 
of the migration of the of the 3.0 kb Seal-digested ds DNA mariner are shown in Figure 17A. Note the 
15 CSSDNA probertarget hybrids formed with IP1248 migrate slowerthan each of the Seal-digested 
targets, but that cssDNA probertarget hybrids fbmied with CP443. a smaller probe migrate closer the 
positions of the fomilll markers. The presence of this labelled slower-migrating species provides 
further evidence for the existence of the multi-stranded DNA hybrids. 

KCPR1 Restriction endori.iclf.asPn nit duplex DNA in Pith e r homolnnn..^ nr r^eterolnnn..^, r««nr^i^ 
20 probgitarqgthvMff To further characterize cssDNA probertarget hybrids formed with heterologous 
DNA targets, circular plasmids pRD.O and pRD.59 were hybridized with biotin-labelled probe IP290 
and then deprotelnized and digested with EcoRI. While plasmid pRD.O contains a unique EcoRI site 
in the region of homology between IP290 and pRD.O. the EcoRI site is deleted in pRD.59 (Figure 
14A). DigestionofcssDNAprobertargethybridswithEcoRI indicates the restoration of W&tson-Crick 
!5 pairing to fomi a fully duplex EcoRI recognition site. Figure 18 shows both the ethidium bromide 
stained gel of the hybrid product of the targeting reaction (Figure 18A and 18B) and the corresponding 
autoradiograph that shows the electrophoretic migration of the biotin-labelled probes (Figure 18C and 
18D). These data show that when RecA-coated IP290 is hybridized to the fully homologous pRD.O 
plasmW all of the probertarget hybrids migrate to the position of fully relaxed DNA (Figure 1 8 A and C. 
Lane 1). Furthermore, upon digestion with EcoRI cssDNArprobe target hybrids can be completely 
cut as shown In Figure 18 A and C. Lane 2. When similar reactions are performed with uncut pRD.59 
targets, we found that not all of the probertarget hybrids are relaxed as with pRD.O targets, as judged 
by the appearance of two bands corresponding to a pRD59 r hybrid, where the hybrids co-migrate 
with Form I supercoiled DNA and a pRD59 ri- hybrid that migrates with relaxed targets (Figure 18B 
and D. Lane 3). When these hybrids are digested with EcoRI we find that the pRD59 r\* hybrid is 



72 



wo 99/60108 



PCTAJS99/10731 



more susceptible to EcoRI cleavage than the pRD59 rl* hybrid (Figure 18B and D. Lane 4). This 
shows that there is a restoration of the EcoRl site in relaxed targets, but not in the non-relaxed I* 
hybrid. Since pRD59 targets do not contain an EcoRI site, cleavage by EcoRI can only be explained 
by re-annealing of cssDNA probe IP290 within the IP290 probe:target pRD59 hybrid. 

5 To further characterize the structural differences between pRD59 rl* hybrids and pRD59 1* hybrids. 
cssDNA probe:target hybrids were formed between IP290 and pRD59. deproteinized and thennally 
meHed for 5 mins at 37»C. 45»C. 55«C. and 66«C. respectively. Figure 19 shows that pRD59 ri* 
hybrids are more thermostable than pRD59 1* hybrids. For both types of hybrids probe.target hybrids 
are completely dissociated after heating to 95'C (data not shown). Taken together these date support 

1 0 the structures of our models for hybrids (Figure 1 3). 

EXAMPLE 6 

l^nmoloaous recombination targeting in fertilized mouse zygotes 

Ornithine transcarbamylase (OTC) is a mitochondrial matrix enzyme that catalyzes the synthesis of 
citruliine from omilhine and cartiamylphosphate in the second step of the mammalian urea cycle. 

1 5 OTC deficiency in humans is the most common and severe defect of the urea cycle disorders. OTC is 
an X-linked gene that is primarily expressed In the liver and to a lesser extent in the small intestine. 
Affected males develop hyperammonemia, acidosis, orotic aciduria, coma and death occurs in up to 
75% of affected males, regardless of intervention. Two allelic mutations at the OTC tocus are known In 
mice: spf and spf-ash, (sparse fur-abnormal skin and hair). In addition to hyperammonemia and 

20 orotic aciduria, spf-ash mice can be readily identified by the abnormal skin and hair phenotype. The 
spf-ash mutation is a single-base substitution at the end of exon 4 that results in alternative intron- 
exon splicing to produce an aberrant non-functional elongated pre-mRNA. Because of the clinical 
importance of OTC defects in humans, there is an intensive effort to develop in vivo nr»ethods to 
correct the enzymatic defect in the spf-ash mouse nwdel. 

25 V\te used the murine spf-ash model of OTC deficiency to test the ability of RecA-coated 

complementary single-stranded DNA (ess) OTC probes to target and correct a single-base substitution 
mutetton In fertilized mouse zygotes. A 230 bp RecA-coated cssDNA probe amplified from the nomial 
mouse OTC gene was microinjected Into embryos derived from matins of B6C3H homozygous spf-ash 
female with nomial B6D2F1 J males. After re-implantation of 75 embryos that were microinjected with 

30 RecA-cpated cssDNA into CD1 foster mothers. 25 developmentally normal pups (17 female and 8 
male) were bom. Sequence analysis of the genomic DNA isolated from tails of the male pups show 
that 3 out of 8 males were mosaic for a homologous recombinatron event at the spf-ash site in exon4 
of the mouse OTC gene. Subsequent breeding of the three founder males with normal females 
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resulted in normal female F, progeny, thus demonstrataing gemiline transmission of the homologous 
recombinant allele as well as phenotypic correction in F, animals. These homologoous recombinant 
changes were stable in and subsequent generations. These studies illustrate cssDNA mediated 
high frequency homologous recombination in liertilized mouse zygotes to create subtle genetic 
nfwdifications at a desired target site in the chromosome. 

PreparatlQp gf RggA^tftd prohft' A 230 bp fragnrtent from the nonnal mouse OTC gene was 
amplified by PGR with primers M9 and M8 from pTAOTC (Figure 20). The PGR fragment was 
purified on Microcon-100 columns (Amicon) and then extensively dialyzed. The M9-M8 amplicon was 
denatured by heating the fragments to 98«C and then coated with RecA protein (Boehringer- 
Mannheim) at a ratio 3 nucleotides/ protein monomer. The final concentration of RecA-coated DNA in 
coating buffer (5 mM TrisOAc. pH 7.5. 0.5 mM DTT. 10 mM MgOAc. 1.22 mM ATP(S. 5.5 [tM RecA) 
was 5 ngl \iL. RecA-coated filaments were made on the day of microinjection and then stored on Ice 
until use. 



Trgn?qeniclVlice : Five superovulated B6C3H (spf-ash/spf-ash) 5-7 week old females (Jackson Labs) 
15 were mated with five B6D2F1 males (Jackson Labs). Approximately 80-100 embryos were isolated 
from oviducts as described in Hogan et al. (1988). The female pronucleus of fertilized embryos was 
microinjected with 1-2 pi of RecA-coated M9-M8 cssDNA probe (5 ng/pL). Approximately 75 embryos 
survived the microinjection procedure and were then re-implanted into a total of three GDI 
pseudopregnant foster mothers (Charles River). Pseudopregnant females were produced by mating 
20 foster mothers with vasectomized GDI males (Gharles River). 

PNA Analysis: Tail biopsies were taken firom all founder mice after weaning at three weeks of age. 
Genomic DNA was isolated from tail bk>psies using standard procedures. To obtain the sequence of 
the DNA at the OTC tocus. genomic DNA was amplified with PCR using primers M10-M11 or MSA- 
MI 1 that flank the cssDNA probe sequence to generate a 250 bp or 314 bp amplicon (Figure 20). 
25 PCR fragments were sequenced manually using the Cyclist Exa Kit (Stratagene), automattoally on an 

Applied Biosystems Model 373A sequencer, or by a MALDI-TOF mass spectrometry system 
(GeneTrace Systems. Menio Park, CA) 

Fgrtilfeed zygotes miCfQiniected with RecA-materi nKf /^ y ioK,^ piasmid pTAOTCI carries a 250 
bp segment of exon4 and surrounding intron sequences from the normal mouse OTC gene. A 230 bp 
30 CSSDNA probe OTC1 was prepared by PGR amplification of pTAOTCI with primers M9 and M8. 
cssDNA probe 0TC1 was denatured and coated with RecA protein as described herein. 
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Homozygous spf^h/spf-ash female and hemizygous (spf-ash/y) males can be phenotypically 
identified by the appearance of sparse fur and wrinkled skin early in development. A cross between 
homozygous spf-ash/spf-ash B6C3H females and normal B6D2F1 males yields heterozygous 
phenotypically normal females and hemizygous males with sparse fur and wrinkled skin. The RecA- 
coated cssDNA OTC probe was microinjected into embryos made from the cross of B6C3H 
homozygous female spf-ash and normal males. The ftemale pronucleus of approximately 80-90 
fertilized zygotes was microinjected with 2 pi of a Sng/pL solution of RecA-coated cssDNA probe 
0TC1. Of these, 75 embryos survived the microinjectton procedure. To demonstrate that embryos 
that have been mteroinjected with RecA-coated cssDNA are viable, the embryos were re-implanted 
into three pseudopregnant CD1 foster mothers. From this. 25 developmentally norrnal pups (17 
female and 8 male) were produced. All of the female mice were phenotypically normal. The eight 
male mtoe (mouse # 7. 14.16,17.22.23.24. and 25) were all affected with sparse-fur and wrinkled skin 
to various degrees. 

Rar ^/^-mated rssHNA probe OTC1 recombin p s with ths homolnnniis chromQsnmai CQpy (?f the OTC 
^^.^o in fPrtiiired mouse zygotes. To determine the genotypes of the 25 founder mice produced from 
microinjected embryos, genomic DNA was isolated from tail biopsies. Genomic DNA was amplified 
with either the primer set M10-M11 orM54-M11 to produce either a 250 bp or 314 bp amplicon. By 
using these primer sets that flank the 0TC1 probe, the DNA amplicon represents DNA from the 
endogenous OTC gene. PCR fragments from all of the eight mfce and several female mtee were 
sequenced to determine the base sequence at the spf-ash locus to determine If a nonnal allele (G) or 
a mutant allele (A) was present In the genomic DNA. Figure 21 shows sequencing gels of 
representative reactions. The panel on the left side shows the sequence of the homozygous spf-ash 
females that donated the eggs to produce the fertilized zygotes where only the mutant base A is 
present at the spf-ash locus, as expected. The sequence of female mouse #8 that should be 
heterozygous shows the presence of equal amounts of the bases G and A as expected. Male mice 7. 
14 (shown). 23. 24.and 25 all showed only the mutant base A at the spf-ash locus, however male mice 
16. 17. and 22 (shown) displayed both G (normal) and A (mutant) at the spf-ash k5cus. 

To eliminate the possibility of PCR artifacts during PCR cycle sequencing the base compositions of 
the samples was independently confirmed by mass spectrometry sequencing (GeneTrace, Menio 
Parte). The relative (%) amounts of the A:G base composition at the spf-ash locus was also quantified 
and determined to be 70%:30% for samples from mouse #16 and #17 and 10%:90% for mouse #22. 
Since OTC is an X-linked gene the presence of mixed bases in male mice is likely the result of the 
mosaic animals produced of a mixture of mutant and gene corrected embryonic cells. 
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Germline transmission of the gepe rnrrer^ OTC , ||o|n To determine if the gene corrected allele in 
the mosaic male founder mice (#-s 16. 17. and 22) could be passed through to the gemiline. these 
mice and a control hemizygous mutant male (#7) were bred with normal B6D2F1 females. In this 
cross, if the mate donates a mutant spf-ash X chromosome, then the resulting femate progeny will be 
heterozygous spf-ash mutants. However if the mate donates a normal (gene corrected) X 
Chromosome the female progeny will be homozygous nomal. In both cases the resulting F1 females 
willbephenotypicallynomwi. The results of these crosses are summarized in Figure 22. In the 

control cross of hemizygous mutant mate #7 with B6D2F1 females. all14 femate progeny were 
heterozygous, as expected. In test crosses of mosaic mate mouse #1 7 and #22 with nomial females 
all resultmg femate progeny (5 and 9. respectively) were heterozygous. However in the cross with 
mosaic mate mouse #16. one out nine total femate progeny was a homozygous nom«l femate (mouse 
# 213). as detemiined mass spectrometry DNA sequencing (GeneTrace. Menio Park), demonstrating 
the gene corrected allele in founder mouse #16 was tranmitted through the germline. 

To further verify that the F1 mouse #21 3 was, in fact, a germline-transmitted gene corrected 
homozygous normal femate. this mouse and a control heterozygous spf-ash/* mouse were bred with 
normal males. In the control cross B with the heterozygous female. 50% of the resulting male F2 
progeny should be mutant spf-ash/y hemizygotes that can be easily detem»ned by the visualization of 
the sparse-fur and wrinkled Skin phenotype. Of.the 38 pregeny produced in this control cress B 14 
were male, and of these. 8 were phenotypically nomial and 6 were mutant, as detem,ined by the 
presence of wrinkled skin and abnormal fur. In the test cross with F1 mouse #213. of the 35 progeny 
produced in this cross, all eleven of the mate progeny were phenotypically nomial. clearly showing «ie 
genotyping of F1 mouse #213 as a germline transmitted gene corrected homozygous normal female. 

As another independent test to determine if the nomial gene corrected allete in mouse #16 could be 
transmitted threugh the germline. mouse #16 was mated with homozygous (spf-ash/spf-ash) mutant 

25 females, in ttiis cross if mouse #16 does not transmit a normal allele, the resultant progeny will either 
be hemizygous (spf-ash/Y) mutant males or homozygous (spf-ash/spf-ash) mutant femates both of 
Which are phenotypically mutant. However if the mouse allete is transmitted through the gemiline 
heterozygous (spf-ash/.) females that are phenotypically nomial will be produced. When mouse #16 
was bred with homozygous (spf-ash/spf-ash) mutant femates. two litters were produced that consisted 

30 of a total of 5 hem^ygous (spf-ashA') mutent mates. 7 homozygous (spf-ash/spf-ash) mutant females 
and 1 phenotypically nomial femate (mouse #1 014). Pictures of representative mice from these 
crosses are shown in Figure 23. The production of the phenotypically normal femate mouse provides 
direct genetic evidence that mouse #16 contains a nomal gene corrected OTC allete that is gemiline 
transmissable. 
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Although the present invention has been described in some detail by way of illustration for purposes of 
clarity of understanding, it will be apparent that certain changes and modifications may be practiced 
within the scope of the claims. 



77 



"^ossmm Pcr/us99/i073i 



CLAIMS 

We claim: 



1. A non-human mammal comprising a modified endogenous gene, wherein said endogenous 

gene is selected from the group consisting of a gene or sequence encoding an ion-channel, a G- 

5 protein coupled receptor (GPCR). an immunoglobulin, a growth factor, an erjzyme. or a milk 
protein. 

2. A mammal according to claim 1 wherein said mammal is a farm animal. 

3. A mammal according to claim 2 wherein said fann animal is selected from the group 
consisting of cattle, sheep, pigs, horses and goats. 

10 4. A mammal according to claim 1 wherein said mammal is selected from the group consisting 
of mice. rats, rabbits , guinea pigs, hamsters and gerbils. 

5. A mammal according to claim 1 wherein said milk protein gene is a lactoglobulin gene. 

6. A mammal according to claim 5 wherein said lactoglobulin gene is the "-lactogtobulin gene or 
the $-lactoglobulin gene. 

15 7. A mammal according to claim 6 wherein said modified "-lactoglobulin gene or iMactogiobulin 
gene does not encode any phenylalanine residues. 

8. A mammal according to claim 1 wherein said endogenous gene is disrupted by deletion of at 
least one nucleotide. 

9. A mammal according to claim 1 wherein said endogenous gene is disrupted by an Insertton 
20 sequence. 

10. A mammal according to claim 9 wherein said insertion sequence is a pojyiinker sequence. 

1 1. A mammal according to claim 9 wherein said insertion sequence is a reporter gene. 

12. A mammal according to claim 1 1 wherein said reporter gene is selected from the group 
consisting of a luciferase gene, a p-galactosidase gene and green fluorescent protein (GFP). 
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blue fluorescent protein (BFP). red fluorescent protein (RFP) and yellow fluorescent protein 
(YFP). 

13. A mammal according to claim 9 wherein said insertion sequence is selected from the group 
consisting of a gene encoding human lysozyme, human growth hormone, human serum 

5 albumin, human globin, a human immunoglobulin, and a human enzyme. 

14. A mammal according to claim 12 wherein said human enzyme is a-1 antitrypsin. 

15. A mammal according to claim 12 wherein said human enzyme is anti-thrombin III. 

16. A mammal according to claim 12 wherein said human enzyme gene does not encode any 
phenylalanine residues. 

10 17. A mammal according to claim 9 wherein said insertion sequence is selected from the group 
consisting of a human gene under control of its endogenous promoter, a modified endogenous 
regulatory element for an endogenous gene, a transcriptional regulation cassette and a 
dimerizing sequence. 

18. A mammal according to claim 17 wherein said endogenous regulatory element is disrupted 
IS by deletion of at least one nucleotide. 

19. A mammal according to claim 17 wherein said regulatory element is disrupted by an 
insertion sequence. 

20. A mammal according to claim 1 wherein said enzyme is a sugar transferase enzyme. 

21. A mammal according to claim 20 wherein said sugar transferase enzyme is "-galactosyl 
20 transferase. 

22. A mammal according to claim 21 wherein said "-galactosyl transferase gene is disrupted by 
deletion of at least one nucleotide. 

23. A mammal according to claim 21 wherein said "-galactosyl transferase gene is disrupted by 
an insertion sequence. 
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24. A mammal according to claim 23 wherein said insertion sequence is a hormone receptor 
gene. 

25. A mammal according to claim 23 wherein said insertion sequence is a viral receptor gene. 

26. A mammal according to claim 23 wherein said insertion sequence is a G-protein coupled 
receptor gene. 

27. A primate comprising a modified endogenous gene. 

28. A primate according to claim 27 wherein said endogenous gene is disrupted by deletion of 
at least one nucleotide. 

29. A primate according to claim 27 wherein said endogenous gene is disrupted by an insertion 
sequence. 

30. A primate according to claim 29 wherein said insertion sequence is a human therapeutic 
gene. 

31 . A primate according to claim 29 wherein said insertion sequence is a human antibody gene. 
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